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Abstract. In this essay, we discuss the notion of optimal transport on (geodesic) metric 
spaces, and the associated (2-)Wasserstein distance. We then examine displacement convexity 
of the entropy functional on V(X) and associated synthetic Ricci lower bounds. In particular, 
we prove that the Lott-Villani- Sturm notion of generalized Ric > K agrees with the classical 
notion of Ric > K on smooth manifolds and that it is preserved under Gromov-Hausdorff 
convergence. We examine in detail the space of probability measures on the unit interval [0, 1], 
denoted Vo, equipped with the (2-)Wasserstein metric. We show that it is isometric to the space 
of nonnegative, nondecreasing functions on [0, 1] with the metric induced from the L 2 norm, 
and using this we show that (Vo,d w ) has vanishing Alexandrov curvature, and give a direct 
proof that the entropy functional is displacement convex on Vo- We additionally examine finite 
dimensional Gromov-Hausdorff aproximations of Qo, and use these to construct the entropic 
measure on Qo first considered by Von Renesse and Sturm, denoted Qq . In addition, we examine 
properties of Qg and explain why one might expect that it has generalized lower Ricci bounds, 
which we show not to be the case. Finally, we discuss the possibility of finding a measure 
M 6 P(Vo) such that (Vo,d w ,M) has generalized Ricci lower bounds. 



Contents 

1. Introduction 

2. Geodesic Metric Spaces 

3. Optimal Transport and Wasserstein Distance 

3.1. Metric and Topological Properties of Wasserstein Distance 

3.2. Dynamical Transport Plans and Geodesies 

4. Metric Structure of (Vo, d w ) 

5. Metric Sectional Curvature 

5.1. Toponogov's Theorem and Alexandrov Curvature 

5.2. Alexandrov Sectional Curvature of Wasserstein Spaces 

6. Metric Ricci Curvature 

6.1. Displacement convexity in Vq 

6.2. Optimal Transport and Displacement Convexity in Riemannian Manifolds 

7. Gromov-Hausdorff Convergence 

7.1. Definition and Basic Properties 

7.2. Stability of Weak Displacement Convexity 

7.3. Approximations of Vo 

8. Measures on Vo and Associated Ricci Curvature Bounds 

8.1. Analytic Consequences 

8.2. Failure of Ricci Bounds for the Entropic Measure 

8.3. Other Measures 

9. Conclusion 



2 
5 
6 
7 
9 
11 
13 
13 
15 
16 
17 
18 
22 
22 
24 
27 
29 
29 
30 
37 
38 



Date: January 21, 2013. 

This document was written in fulfillment of the essay option of Part III of the Mathematical Tripos at 
Cambridge University for the 2010-2011 academic year. I would like to thank Dr. Clement Mouhot for agreeing 
to set and mark this essay, assisting me in learning the material contained within, his extensive editing help, 
as well as for suggesting the problem of Ricci bounds on (V(X),d w ). All of the figures are created using the 
TikZ package and I would like to thank Thomas Trzeciak for his spherical drawing code that I modified to create 
Figure|4l which can be found at http: //www. latex- community . org/f orum/viewtopic .php?f =4&t=2111 Finally, 
I would like to thank the Cambridge Gates Trust for their financial support this year. 

1 



2 



OTIS CHODOSH 



Appendix A. Nonsmooth Change of Variables 3S 

Appendix B. A Brief Overview of Riemannian Geometry and Curvature 41 

B.l. Riemannian Metrics and Connections 41 

B.2. Curvature 42 

B. 3. Jacobi Fields 42 
Appendix C. Manifolds with Lower Ricci Bounds 43 

C. l. Topological and Geometric Consequences 44 
C.2. Gromov-Hausdorff Limits of Bounded Ricci Manifolds 44 
C.3. Upper Ricci Bounds 45 
References 46 



1. Introduction 

Lott and Villani in [26] and independently Sturm in |32} [33| [M] have developed a synthetic 
notion of lower Ricci bounds on geodesic metric spaces by considering displacement convexity 
of certain entropy functionals along geodesies in the space of probability measures on the un- 
derlying metric space. Their notion agrees with the classical condition of lower Ricci bounds 
on Riemannian manifolds, and is preserved under Gromov-Hausdorff convergence. In addition, 
Lott- Villani have used this notion to examine the relation between this displacement convexity 
and log Sobolev, Poincare, and Talagrand inequalities. In this essay, we give an overview of this 
notion, discussing Wasserstein distance on the space of probability measures on a metric space, 
as well as proving the agreement of the synthetic Ricci bounds and classical Ricci bounds on 
Riemannian manifolds and the preservation of synthetic bounds under Gromov-Hausdorff con- 
vergence. We also give a detailed examination of the space of probability measures on the unit 
interval Vo '■= ^([0, 1]), equipped with the Wasserstein metric. In this case, the one dimension- 
ality of [0, 1] allows us to give more explicit formulas for the dynamics of optimal transport, and 
provides some insight into the more general case. We also examine the analytic consequences 
of a measure on Vq with lower bounds on Ricci in the Lott-Villani-Sturm sense, and show that 
the entropic measure constructed by Von Renesse and Sturm in [38] does not admit lower Ricci 
bounds, even though there are good reasons to suspect that it does. 

The structure of this essay is as follows. In Section [2j we discuss the notion of geodesic spaces, 
which are metric spaces where the distance between two points is the same as the minimum 
of the lengths of all continuous paths between the two points. We will call curves achieving 
this minimum geodesies. Then, in Section we examine optimal transport and Wasserstein 
distance. In particular for a compact metric space (X, d) , this will allow us to define a metric 
on V{X), the space of probability measures on X by 

d W ((i, v) 2 : = inf / d(x,y) 2 dTr(x,y) 

where II(/i, v) is the set of transport plans, i.e. probability measures on X x X with first and 
second marginals fi and v, respectively. We show that this is a metric, turning 'P(X) into a 
compact metric space, with the weak* topology. Finally, if (X, d) is a geodesic space, then we 
will show that (V(X),d w ) is as well and we will discuss geodesies in this space. In Section [H 
we discuss the space of probability measures on the unit interval, "P([0, 1]), which we view as 
a metric space with the Wasserstein metric. We show that this metric space, which we denote 
(Vo,d w ) is isometric to (Go,d L ) where Go C L 2 ([0, 1]) is the set of nonnegative, nondecreasing 
right continuous functions on [0, 1] with f(l) = 1 and d L is the metric induced by L 2 distance. 
This isometry (which we denote is given by the inverse distribution map in one direction, and 
the pushforward map applied to Lebesgue measure in the other direction. The isometry ^ will 
be very important to our understanding of Vo, because it allows us to work with Go, which is a 
totally geodesic, compact subset of a Hilbert space, and thus has reasonably simple properties. 
For example, we show that geodesies are unique, and in fact are straight lines between their 
endpoints. 
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In Section we describe a synthetic notion of sectional curvature. We say that a geodesic 
space has nonnegative sectional curvature in the Alexandrov sense if geodesic triangles in the 
space are "fatter" than corresponding triangles in the plane, M?. In particular, we state To- 
ponogov's theorem, which says that this is equivalent to nonnegative sectional curvature in 
the classical sense. Then, we go on to discuss the relation between the sectional curvature of 
M and Alexandrov curvature V(M) for M a Riemannian manifold. As a special case of this 
consideration, we prove that that (Vo,d w ) has "vanishing Alexandrov curvature." In Section 
El we define the entropy functional on V(X) by 



Ent(/x|rn 



{J x p log p dm for p <C m and p = pm 
+00 otherwise 



which allows us to discuss displacement convexity for geodesic measure spaces (X,d,m). We 
define the notion of weak a.c. displacement convexity (cf. Definition 16. ip . which is the condition 
that for po,pi £ V(X) with pi <C m, then there exists some geodesic in V(X) between them, 
pt-, such that 

K 

Ent(p t \m) < tEnt(pi\m) + (1 - t)Ent(p \m) - —t(l - t)d w (p , pi) 2 . 

We then prove that for a compact Riemannian manifold, choosing the reference measure to be 
the normalized volume measure, i.e. m = v ^°*jj^ , the triple (M,d,m) has lower Ricci bounds 
Ric > K if and only if Ent(-|m) is weakly a.c. displacement if-convex. In proving this, we 
discuss particulars about optimal transport on manifolds. Because of this result, we will often 
refer to weak a.c. displacement i^-convexity as generalized Ric > K. Additionally in this 
section, we explicitly investigate displacement convexity over the interval, proving directly that 



Ent(*(/)|Leb) = - / log f(x)dx. 
Jo 





for f £ Go (i.e. a nondecreasing function on [0, 1]) where ^f(f) = /*Leb is the image of / under 
the isometry ^ : (Go, d L ) — > (Vq, d w ). By concavity of log, this allows us to directly prove that 
Ent(-|Leb) is displacement 0-convex in this case. 

In Section [71 we define the notion of Gromov-Hausdorff convergence of metric measure spaces 
and give an overview of the proof that weak a.c. displacement convexity is preserved under this 
type of convergence. We say that a sequence of (compact) metric measure spaces (X n , d n , p n ) 
converges in the (measured) Gromov-Hausdorff topology to (X,d,p) if there is a sequence of 
e n > 0, with e n — > and associated maps f n : X n — > X (not necessarily continuous) which are 
e n -isometries, i.e. 

(1) forx,x'G X, \d x (x,x') - d Y {f n (x), f n (x'))\ <e 

(2) for all y G Y there is x £ X with dy (y, f n {x)) < e 

and furthermore, that (f n )*Pn — > P in the weak* topology. We then show that generalized 
Ricci lower bounds are preserved under such convergence. We discuss the relation of this 
with Gromov's compactness theorem, which says that the set of Riemannian manifolds with 
a fixed dimension, diameter upper bound and Ricci curvature lower bound is precompact in 
this topology. Finally, we show how to approximate the space (Go,d L ) by finite dimensional 
metric spaces. Then, in Section [HI we discuss reference measures on (Go,d L ) (or equivalently 
on (Vo,d w ), but because Go is a totally geodesic subset of a Hilbert space, it is often easier to 
work with). First, we discuss the analytic consequences of such a measure having generalized 
Ricci lower bounds, in particular discussing the resulting log-Sobolev and Poincare inequalities. 
Then, we will discuss the entropic measure Qq, on Go, constructed by Von Renesse and Sturm 
in [38] , defined by requiring that 

/ i*(5(*i),...,5(tjv))dQo 
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= T-rjV 777 / U fa, ■ ■ ■ , X N) Y[fa+1 ~ Xif^ **> l d Xl ■ ■ ■ dx N . 

We give a concrete proof of the existence of Qq , which does not require the Kolmogorov extension 
theorem, as used by Von Renesse and Sturm. Then, we discuss why one might expect that 
(Go,d L2 ,Qq) would have generalized Ric > 0, or at least bounded below by some real number, 
including the work by Ddring and Stannat in [12], showing that (Go, d L2 , Qq) admits log-Sobolev 
and Poincare inequalities. However, we go on to prove that this space does not have generalized 
Ric > K for any real K. We show this by using a generalization of the proof of log-concavity 
of displacement convex measures on a Hilbert space (cf . [2] , Chapter 9) . Finally, we discuss the 
possibilities of a measure M £ V(Qq) having generalized Ricci lower bounds, suggest methods 
for constructing such an M, as well as necessary conditions that could be used to rule out such 
a property for a given M, and give a brief overview of the theory of optimal transport in Hilbert 
spaces. 

We also include Appendix [XJ in which we discuss nonsmooth changes of variables, and for- 
mulas for the density of nonsmooth pushforward measures, results which we make use of in 
various places throughout the text, particularly in Section El In Appendix iBl we give a very 
brief introduction to Riemannian manifolds and curvature. Then in Appendix[Cl we give a brief 
overview of the theory of manifolds with lower Ricci bounds. 

A few words on notation and terminology are in order. We denote the space of probability 
measures on a topological space X by 'P(X), and for a map / : X —}Y will write the pushforward 
of a (probability) measure p G V(X) by f*p £ V(Y), which is the measure such that for A C Y, 
measurable, f*p(A) = p(f~ 1 (A)). For a measure m on X, we will write < m if p{A) = 
for all measurable sets with m(A) = 0. If this holds, then we will also say that p is absolutely 
continuous with respect to m. It is then standard by the Radon-Nikodym theorem that there is 
a measurable function p on X such that p = pm. We will also make frequent use of Prokhorov's 
theorem, which in the case we need simply says that if X is a compact space then so is V(X) 
with the weak* topology (for a proof of a slightly more general version, cf. [H Theorem 6.5]). 
We further remind the reader that convergence p^ — > p in the weak* (also known as weak and 
narrow convergence) topology on X means that J x fp^ — > J x fp for all continuous functions 
/ on X. We write C(X) for all continuous functions, and recall that by one version of the 
Riesz representation theorem, we can identify measures on X with positive linear functionals 
on C(X)E 

We must emphasize that the majority of the material below is not in any way original work 
of the author, and should be considered as an expository work. We have tried to present the 
material in the Lott-Villani paper [26J in a simplified form. In particular, we have only discussed 
what they call oo-Ricci curvature, and have not discussed convexity of general functionals on 
V(X), just Ent(-|m). We have also made use of the Sturm papers [Ml El]) which present much 
of the same information, but take a slightly different viewpoint, and have the advantage (from 
the point of view of our exposition) that the first paper only considers "oo-Ricci curvature," 
which is what we have focused on. We have expanded on the background material on optimal 
transport given in the Lott-Villani paper, and following Villani's two books on optimal transport, 
[361 [37] have given (mostly) self contained proofs of the basic facts about the theory. For metric 
geometry, we have mainly relied on Gromov's book [TTJ and Burago-Burago-Ivanov's textbook 
[5]. The reader looking for further information about any of the material presented within will 
find the above references excellent places to start, and should hopefully be aided by the in-text 
citations as well. 

Finally, we remark on the parts of the essay which we believe to be original. We believe 
our proof (but certainly not the result) of Proposition 14.11 showing that (Qo,d L ) and (Vo,d w ) 
are isometric is original. We have presented it this form, which relies on knowledge of optimal 



N.B. in our statements of the Radon-Nikodym theorem, Prokhorov's theorem and the Riesz representation 
theorem, we have always been assuming that X is a compact metric space and we will always only consider Borel 
measures. 
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transport on compact Riemannian manifolds, rather than giving a more concrete proof, because 
we wanted to avoid discussing optimality conditions for transport plans. Secondly, the Gromov- 
Hausdorff approximations of (Go, d L ) in Proposition 17.61 are probably original, but we note that 
similar ideas are examined by Andres and Von Renesse in [3] in the context of Mosco- and T- 
convergence of Dirichlet forms. It seems that our proof (but not the result) of the existence of 
the entropic measure, Qq without relying on the Kolmogorov extension theorem, in Proposition 
18.31 is original. We believe that the approximation of Qq by measures on finite dimensional 
spaces in the sense of (measured) Gromov-Hausdorff convergence has not been studied in this 
exact form before (cf. [3] for related ideas, however). Finally, we believe that the results of 
Theorem 18.101 which says that (Go,d L2 , Qq) does not have generalized Ricci lower bounds has 
not been shown before. 

2. Geodesic Metric Spaces 
For (X, d) a metric space and j : [a, b] — > X a continuous curve, we define the length of 7 to 

be 

N 

(2.1) L(j)= sup £d(7(ti).7&fl)). 

a=t <ti<---<t N+1 =b i=Q 

Certainly this does not have to be finite. Curves 7 with £(7) < 00 are known as rectifiable 
curves. As an example, recall that if (X, d) is the metric space associated to a Riemannian 
manifold, then (|2. 1[) is the same as the usual definition of length for smooth curves, so in this 
case there are clearly plenty of rectifiable curves. Notice that by the triangle inequality applied 
to each term in the supremum in (|2.ip . we have that 

(2.2) d( 7 (a), 7 (b))<L( 7 ). 

We say that (X, d) is a length space if we can measure the distance between two points from 
knowledge of the lengths of curves between them, i.e. for p,q G X 

d(p,q) = inf L(>y) 

7 

where the infimum is taken over all paths from p to q. We say that (X, d) is a geodesic space 
if the infimum is actually attained, that is for all p,q £ X there is a curve 7 from p to q such 
that L(j) = d(p,q). We will call such a curve a geodesic. It is important to note that our 
terminology slightly differs from the classical notion of a geodesic on a Riemannian manifold in 
that we require a geodesic to be globally length minimizing. 

For points p,q 6 (X, d) a metric space, we call r a midpoint of p and q if d(p, r) = d(q, r) = 
7}d(p,q). This provides an often convenient characterization of geodesic spaces. 

Lemma 2.1. A metric space (X,d) is a geodesic space if and only if midpoints exist. 

Proof. It is not hard to show that the map t 1— > £(7|[ ,t]) is a continuous function of t as long 
as 7 is rectifiable (see [5] Proposition 2.3.4), so if 7 from p to q is a geodesic, then there is t 
with L(7| [0)t] ) = iL(7) = \d(p,q). Thus, for this t, d(p,~/(t)) = d(j(t),q) = \d(p,q), so j(t) is 
a "midpoint" between p and q. 

Conversely supposing that midpoints exist, for points p, q, there is a midpoint X\j 2 between p 
and q. Continuing this, there are midpoints xiu of p and X\j 2 and x 3 / 4 of x x / 2 and q. Thus, by 
induction, there are points Xfc/ 2 " such that xq = p, x\ = q and x^k+i)/2 n + 1 is the midpoint of 
x k j 2 n and X(fc +1 )/ 2 "- Furthermore induction, it is clear that this gives a map 7 from the dyadic 
rationals to X such that 

d( 7 (A:/2 n ),7(fc , /2 n ')) = \k/2 n - k'/2 n '\d(p,q) 



and because of this, 7 extends by continuity to a continuous map 7 : [0, 1] — > X, with L(-y) = 
d(p,q). ' □ 
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Lemma 2.2. Any geodesic 7 : [a, b] — > X can be continuously reparametrized so that it is a 
map 7 : [0, 1] — > X and 

d{ 1 {t), 1 {t')) = \t-t'\d{ 1 {Q), 1 {l)). 

Further details and a proof of this can be found in Proposition 2.5.9]. From now on, we 
will assume that all geodesies are so parametrized. 

We will often use the term induced length metric or intrinsic length metric in the following 
situation. Suppose that we have a compact geodesic space (X, d) and a (path connected) closed 
subset Y C X. We can define a metric d on Y by 

d(y,y') = inf L(j), 

■yCY 

where the infimum is taken over all paths 7 from y to y' which are contained entirely inside Y 
but L("f) is the length as measured in X with the metric d. It is possible to show that under 
our assumptions on X and Y, (Y,d) is a geodesic space; cf. [5j Chapter 2]. As such, we call d 
the intrinsic/induced metric. 

3. Optimal Transport and Wasserstein Distance 

Let (X, d) be a compact geodesic space. We will denote the space of probability measures on 
X by V(X). For k = 1,2, we define proj fc : X x X — > X to be projection onto the k-th factor. 
For two probability measures fi, v G V(X), we define the set of admissible transport plans to be 

(3.1) v) = {ir G V(X x X) : (proj^vr = fi, (proj 2 )*vr = v) 

where for k = 1, 2, (proj fc )*7r G V(X) is the k-th marginal, (proj fc )*(j4) = 7r[(proj^ (A)]. This 
is certainly nonempty, because the measure /U (g) z> which is defined by 

f(x,y)dfi®i/(x,y) := / / f{x,y)dn{x)dv{y) 

XxX JXJX 

for continuous / € C(X x X), is clearly in H([i,v). Loosely speaking, a transport plan is a 
proposal for how to move the mass of \i around so as to assemble the distribution of mass 
prescribed by v. Given the above definition, we define the 2- Wasserstein distance between \i 
and v to be 

(3.2) d W {n,vf:= inf / d(x,y) 2 dir(x,y). 

Treu(p,u) J XxX 



Lemma 3.1. The infimum in (|3.2p is always achieved. 
Proof. Letting ir^ £ n(/i, v) be a minimizing sequence, 

lim / d(x,y) 2 d7Tk = d w \[i,vY 

k ^°°JxxX 



k- 

Because X x X is compact, by Prokhorov's theorem, extracting a subsequence, we can assume 
that there is some tt G V(X x X) such that tt^ — > it in the weak* topology. Because d(x,y) 2 is 
continuous on X x X, we thus have that 

d(x,y) 2 dir = d w (fi,u) 2 . 

XxX 

It remains to show that n G II(//, z/). For continuous functions f,g£ C(X) 



(f(x) + g(y))dn(x, y) = lim / (/(a;) + g(y))dir k (x, y) 

XxX k ^°°JXxX 

lim I / f(x)dfi(x) + / g{y)dv{y) 

k^oo \J X J x 

f(x)dfi(x) + / g{y)dv{y) 
x Jx 

Taking g zero, because / was arbitrary, this shows that (proj 1 )*7r = \x and similarly setting / 
to zero gives (proj 2 )*vr = v, so we have that ir G v). □ 
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We call such a minimizer an optimal transport plan. We remark that for a fixed fj,,u G V(X) 
it could be true that there is more than one optimal transport plan. 

3.f . Metric and Topological Properties of Wasserstein Distance. In the following proof, 
we give a simplified version of the exposition in |36l Chapter 7]. 

Proposition 3.2. The Wasserstein distance, d w , defined by (|3.2p is a metric on V(X). 

Proof. Clearly d w is symmetric, nonnegative and finite on V(X). Furthermore, d w (/x, n) = 
for all \x G V(X). If d w (fj,, v) = 0, then by Lemma [3. H there exists an optimal plan it G II(^, v) 
with 

/ d(x,y) 2 dir(x,y) = 0. 

JXxX 

This shows that rr must be concentrated on the diagonal Ax C X x X, and thus is invariant 
under x o y. Thus, for u G C(X) 

u(x)dfj,(x)= / u(x)dn(x,y) = / u(y)dir(x, y) = / u{y)dv{y). 

X J XxX JXxX JX 

which shows thate [i = v. Thus, it remains to prove the triangle inequality. This requires 

Lemma 3.3 (Gluing Lemma). For probability measures /^i,^2>M3 £ V(X) and associated trans- 
port plans 7Ti2 G n(/ii, ^2)1 7T23 £ n(^2 5 ^3), there exists a probability measure tt G V(X x X x X) 
with marginal tt\2 on the first two factors and 7^3 on the second two factors. 

We will prove this after we show how it establishes the triangle inequality. For ^1,^2,^3 G 
V(X), let 7Ti2 G IT(^i,/i2) and 7^3 G II(/X2,/i3) be optimal transport plans. The gluing lemma 
says that there is tt G V{X x X x X) with marginals 7Ti2 and 7^3. We let 7i"i3 be the marginal 
on the first and third factors of X. It is clear that this is a transport plan between ji\ and fi 3 
(it is not necessarily optimal), giving 

d w (m,fi 3 ) 2 < / d(xi,x 3 ) 2 d7Ti 3 (xi,x 3 ) 

J XxX 

d{x 1 ,x 3 ) 2 dir{xi,X2,x 3 ) 

XxXxX 

< I [d(xi,x 2 ) + d(x 2 ,x 3 )] 2 d7r(xi,x 2 ,x 3 ) 



XxXxX 



[d(xi, X2) 2 + d(x 2 ,x 3 ) 2 + 2d(xi,x 2 )d(x2,x 3 )] d7r(xi, x 2 , x 3 ) 

XxXxX 

d(xi,x 2 ) 2 diri2(xi,X2) + / d(x 2 ,X 3 ) 2 (i7r23(x2,X3) 
XxX JXxX 

+ 2/ d(xi,X 2 )(i(x2,X3)(i7r(xi,X2,X3) 
JXxXxX 

= d W (jil, fl 2 ) 2 + d W (n2, /J.3) 2 + 2 / d(xi,X2)d(x2,X3)d7r(xi,X2,X 3 ) 

JXxXxX 

< 2 f / d(xi,x 2 ) 2 (i7r(xi,X2,X3) N ) f / <i(x 2 , x 3 ) 2 (i7r(xi, x 2 , x 3 )\ 

\JXxXxX / \JXxXxX / 

+ d W ( f l l ,fl 2 ) 2 +d W ( f l2^3) 2 

= d w (fJLi, M2) 2 + 2d w (m, fi 2 )d w (H2, ^ 3 ) + d(^2,^3) 2 

= [d W ^i^2) + d w {^^)} 2 
proving the triangle inequality. Thus, it remains to prove the gluing lemma (Lemma I3.3p . 
Proof of the Gluing Lemma. Let V C C(X x X x X) be the vector subspace 
V := {^I2(xi, x 2 ) + ^23(x2, x 3 ) : <pi 2 , ^23 G C(X x X)} 



s 
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X 



and define a functional G : V — > K by 

G(ipu + <P23) ■= / <Pl 2 (xi,X 2 )dTT 12 (xx,X 2 ) + / ^23(x2,X 3 )(i7r23(x2,X3). 

JXxX JXxX 

It is not a priori clear that this is well defined. If ip\2 + 9223 = ¥>12 + ^23 as functions, then 

V12(X1,X 2 ) - <P12(X1,X 2 ) = <P23(X2,X 3 ) - if23(x2,X 3 ), 

which shows that both sides are functions of X2 only. Thus 

[<P1Z(XI,X 2 ) ~ <Pl2(xi,X 2 )}dlTi2(x 1 ,X2) = / [<Pl 2 (xi,X 2 ) ~ <fl2 {x\ , X 2 )]dfX 2 (x 2 ) 
XxX JX 

.<P23(X 2 ,X 3 ) - ip23(x2,X3)]dfl2(x 2 ) 

[<f23(.X 2 ,X 3 ) - (p23(x2,X 3 )]dTT23(x2,X 3 ) 

XxX 

and rearranging the terms shows that 

G(ipi2 + ¥23) = G(<p 12 + <p 23 ) 

so V is in fact well defined. Furthermore, G is clearly bounded and linear. Thus Hahn-Banach 
tells us that there exists an extension to a positive functional G : C(X x X x X) — > R, and by 
the Riesz representation theorem, G is represented by some (Borel) measure ir G V{X x X x X). 
Because 

[</?12(xi, X 2 ) + ip23(x2,X 3 )]dTT(xi,X2 1 X 3 ) = G(if 12 + ip23) 

XxXxX 

= G(lfi2 + ¥23) 

^Pi2(xi, x 2 )dn 12 (xi,x 2 ) 

XxX 



+ / V23(x2,X 3 )dlT2 3 (x2,X 3 ) 
J XxX 

it is clear that tt has marginals ir\2 and ^23, as desired. □ 

This completes the proof that d w satisfies the triangle inequality, and thus the proof that it 
is a metric on V(X). □ 

The next proof is a simplification of the proof of |37[ Theorem 6.9], and we will rely on a 
result characterizing the support of optimal plans known as c-monotonicity, references for which 
we give below. 

Proposition 3.4. Equipped with the Wasserstein distance, (V(X),d w ) is a compact metric 
space with the same topology as that on V{X) defined by weak* convergence. 

Proof. It is enough to show that fik — > \i in the weak* topology if and only if we have that 
d w \nki A 4 ) ~~ ► 0, because then we have compactness by Prokhorov's theorem. To begin with, 
suppose that we have the convergence — > fi in the weak* topology. Let irk £ n(/Xfc, yu) be 
optimal transport plans. By compactness of V[X x X), we may extract a subsequence ity such 
that 7Tfc/ — > tt in the weak* sense. First, we claim that tt is a transport plan, i.e. tt S II(//,yu). 
To see this, notice that 



<p(x) + ip(y)]dn(x, y) = lim / [<p(x) + ip(y)]dTT k > (x, y) 
XxX k ^°°JxxX 

lim / ip(x)dn k >(x) +/ i/)(y)d[i(y) 



k'— ¥00 



x JX 



The usual version of Hahn-Banach does not tell us that the extension need be positive. The reader could 
try to modify the proof of Hahn-Banach to show this (some caution must be taken here: the fact that there is 
always an element in V dominating any element in C(X x X x X) (e.g. a constant function) should be used in 
the proof). Alternatively, cf. [U Section 8.11]. 
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Lp(x)dfi(x) + / ip{y)d^i{y), 

X JX 

showing that it £ II(//, //). We now rely on |37} Theorem 5.20], which says that the weak* limit 
of optimal plans is optimal (which follows from the discussion of c-monotonicity of supports of 
optimal plans, which can be found in various places, including the cited theorem, or [2] Theorem 
6.1.4]). Thus, 

lim d w (fi k i , fi) 2 = lim / d(x, y) 2 dir k i(x, y) = \ d(x, y) 2 dir(x, y) = 0, 

k '^°° fc '^°°iXxI JXxX 

where the last equality follows from the fact that it £ II(//, //) is optimal, so its cost must be 
zero. 

Conversely, suppose that d w (fJ, k , A*) — > 0. Because V(X) is compact, extracting a subsequence 
there is some jl € V(X) with fj, k i — > \x in the weak* topology. The above part of the proof shows 
that d w (fi k/ ,fl) ->• 0. By uniqueness of limits in metric spaces, this shows that /i = fi, and 
because we can repeat this argument for any subsequence, we have that [i k — > /i in the weak* 
topology. □ 

3.2. Dynamical Transport Plans and Geodesies. From now on, we will assume that (X, d) 
is a (compact) geodesic space. We define Lip([0, 1]; X) to be Lipschitz continuous maps from 
[0, 1] — >• X with the uniform topology, and let 

rcLip([0,l];X) 

be the (compact by Arzela-Ascoli) subset of unit speed parametrized geodesies. For t S [0, 1], 
the evaluation maps et : T — > X, given by et : 7 1— > "f(t) are clearly continuous. We call a 
Borcl measure on T a dynamical transport plan between fj,,v G V(X) if (eo, ei)*G G n(/x, v). 
Intuitively, O is a choice of which geodesic to transport the mass along, instead of just a initial 
and final endpoint. If (eo,ei)*0 is an optimal transport plan then, reasonably enough, we call 
an optimal dynamical transport plan. Again, even for a fixed optimal transport plan ir, there 
might be multiple dynamic plans with (eo, ei)*6 = it. For example, if 

(3.3) X = ([-2, -1] x {0}) U {{x, y) : x 2 + y 2 = 1} U ([1, 2] x {0}) C R 2 

is equipped with the "induced length metric," (i.e. the distance between two points is the 
minimum of the lengths of curves between the two points, where length is measured in the 
usual way for a curve in M 2 ), and [i = Sr 2 ,o) an d v = ^(2,0) > ^ ^ s clear that it = <5(_2,o)x(2,o) 
is the unique optimal transport plan, but there are two geodesies between (—2,0) and (2,0), 
say 71 which goes along the upper hemisphere of the circle and 72 which goes along the lower 
hemisphere, as in Figure [TJ Then, for any s £ [0, 1], defining 

Q s := se) 7l + (1 - s)J 72 

we clearly have (eo,ei)*0 s = it. 




Figure 1. An example of a unique optimal transport plan on the space X 
defined in (j3.3j) (transporting all of the mass at (—2,0) to (2,0) is the only 
transport plan, and is thus the optimal one) with multiple associated dynamical 
transport plans. 
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Proposition 3.5. Assuming, as above, that (X,d) is a compact geodesic space, for any fj,,u G 
V(X), and any optimal transport plan tt, there exists an optimal dynamical transport plan 
between fi and v, Q with (eo,ei)*G = tt. 

Proof. If (eo,ei) admits a Borel right inverse S : X x X — > V with 

(e ,ei) oS = Idxxx, 
then we are finished, because this implies that 

TT = Id*7T = (eo,ei)*5*7r 

so we can define G = S*ir. The map (eo,ei) : V — > X x X is certainly surjective, as X is a 
geodesic space, so a right inverse exists by the axiom of choice. However, there is no reason 
that it should be measurable, and we must turn to a technical result known as a measurable 
selection theorem, one version of which says that because XxX and T are separable, compact, 
and metrizable, there is a Borel measurable right inverse to (eo, ei). In [A0\ Corollary A. 6], this 
version of measurable selection is shown to follow from |18t Proposition 3.1]. □ 

Corollary 3.6. If(X,d) is a geodesic space, then so is (V(X),d w ). 

Proof. For fi, v G V(X), by Lemma l3.1| there exists an optimal transport plan tt G Tl{fi,v), 
and by Proposition 13.51 there is a corresponding optimal dynamical transport plan G. We claim 
that fit ■= (et)*0, where : T — > X, 7 \-t j(t) is a geodesic between fi = fiQ and v = To see 
this, we calculate 

d w (fj, t , fJ>t>) 2 < / d(x,y) 2 d[(e t ,e t >)*Q](x,y) 

J XxX 

= J^(t)Mt')) 2 dQ( 7 ) 

= |t-t'| 2 ^d(7(0),7(l)) 2 dG( 7 ) 

= \t-t'\ 2 [ d(x,y) 2 (i[(eo,ei)*Q](x,y) 

J XxX 

= \t- t'\ 2 I d(x,y)dir(x,y) 

J XxX 

= \t-t'\ 2 d w (^, l l 1 ) 2 . 

This implies that fit is continuous and moreover that L{fi t ) < d {fj,Q, fi\). Thus by (|2.2p . 
L(fit) = d w (fio, fii), so fit is a geodesic between fi and z/, because we must have equality in the 
first line above. □ 

Proposition 3.7. The map i : X — > V(X) defined by x h-> 5 x is an isometric embedding. 

Proof. For x,y £ X, notice that H(8 x ,5 y ) = {Sr x ,y)}, so 

d w (i(x),i(y)) 2 = / d(x,y) 2 5( x>y) (x,y) = d(x,y) 2 , 

J XxX 

showing that i is an isometric embedding, as claimed. □ 

Notice that i(X) need not be a totally geodesic subset of V(X). Certainly for x,y G X, if 
j(t) is a geodesic between x and y, it is easy to see that 8^u\ is a geodesic between S x and ^ 
that lies entirely in i(X). However, if there is more than one geodesic between x and y, an 
(infinite) convex combination of these geodesies will also be a geodesic, which is disjoint from 
i(X) except for at its endpoints. For example, on (S 2 ,dg2), if we take x to be the north pole 
and y the south pole, then clearly if ^(t) : [0, 1] — > S 2 is the constant speed geodesic along any 
great circle from x to y then S^m is a geodesic in V{S 2 ) from 5 X to 5 y and lies entirely in i(S 2 ), 
but for example in coordinates {<p, 9) on the sphere, where <f> is the angle from the z-axis and 
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9 is the azimuthal angle of the projection of the point onto the xy-plane in polar coordinates, 
then if we define 

A t :={(7rt,9):ee [0,2vr)} 

to be the circle at <p = nt, then if we write "H 1 for the 1-dimensional Hausdorff measure on S 2 , 
it is not hard to see that 

'(27rsinvrt)- 1 'H 1 U t for t € (0, 1) 
H '■= { S x for t = 

5 y for t = 1 

is a geodesic between S x and 5 y in V(S 2 ) which is most certainly not in i(S 2 ). This is illustrated 
in Figure [21 




Figure 2. This figure shows that the image of X in V(X) under the map 
i '. x i — y Sx is not totally geodesic. There is a geodesic between 5 X and 8 y which 
at time t is supported on A t , as described in the text, and is thus clearly not in 
i{X). 



4. Metric Structure of (V ,d w ) 

We now specialize to studying ('PQO, l\),d w ), which will often write (Vo,d w ). 

Proposition 4.1. Letting Qq C L 2 ([0, 1]) be the subset of right continuous, nondecreasing maps 
g : [0, 1] — > [0, 1], and d L2 the metric induced on Qq from L 2 , then the map 

(4.1) v:(g ,d L2 )^(r ,d w ) 

g i — y <7*Leb 

is an isometry. The inverse is given by 

(4.2) VP -1 : /j, i-> 9ll 
where g^ is the inverse distribution function defined 

(4.3) ^(t):=inf{fl€[0,l]:M([0,fl])>t} 
with the convention that inf := 1. 

Proof. It is clear that ^ as defined is a bijection with stated inverse. Thus, it remains to show 
that it is an isometry. For f,g£ Go, notice that (/, <7)*Leb[ 0j i] G U(^(f), ^(g)), and that 

rl 

\2, 



\x-y\ z d[(f,g)*Leb}(x,y)= / \f(s)-g(s)\ 2 dLeb(s). 

[0,1] x [0,1] 
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Thus 



(*(f),*(g))<d L2 (f,g). 



There are various ways to show the opposite inequality. One way is to use the Knott-Smith 
optimality condition, which says that a plan ir between two measures on R n is optimal if and only 
if there is a convex, lower semi-continuous function ip : M n — > K with supp(7r) C Graph (9<£>) 
(where dcp is the sub differential of ip). This approach is taken in the proof of [36, Theorem 
2.18](which is stated for V(M), but the same proof works without change for Vq). Alternatively, 
a more probabilistic proof, using the Hoeffding-Frechet theorem which relates the cumulative 
distribution function of a measure on M 2 to the cumulative distributions of its marginals, is 
given in [31) . However, in order to avoid distracting ourselves with these notions, we will rely 
on Theorem [63 All that we will need is that on a compact manifold (M, g), if /j,, v E V(M) with 
\i,v <C m = (vo1(M)) _1 vo1m then there is an optimal transport plan tt of the form (Id,-F)*// 
for some measurable F : M — > M. Such a plan is called a Monge transport plan. Furthermore, 
we use that F(x) = exp x (—X7(f)) for a ^--concave function <j), but all we will really need is the 
special case of the circle M = S , where this implies that F is "nondecreasing," in the sense 
that it lifts to a nondecreasing function M — > K. 

Given this fact, we will finish the proof by isometrically embedding [0,1] into (S 1 ,^), the 
standard circle of circumference 3, R/3Z. For /, g £ Qq with /, g continuous and strictly 
increasing, defining \x = ^(/), v = ^(g) 6 Vq, it is clear that our assumptions imply that \i, v <C 
Leb and supp(/i) = supp(i^) = [0,1]. Furthermore, they give rise to measures fl, v G V(S 1 ) 
supported on the image of [0, 1], and that <C m, so by Theorem 16.71 there is an optimal 
Monge plan tt = (Id, F)*fi between jl and v. This implies that F*/2 = v, so F(supp(/i)) C 
supp(P), so defining a new function F : [0,1] — >• [0,1] by F := ^|[o,il) we claim that tt := 
(Id, F)*ijl, this is an optimal Monge transport plan. It is clearly a transport plan, and if it were 
not optimal, by considering [0, 1] x [0, 1] as a subset of S 1 x S 1 we could use an optimal plan 
to contradict 7r's optimality. Because F is nondecreasing, it is clear that F is nondecreasing. 
Furthermore, because v <C Leb, we may assume without loss of generality that F is right 
continuous, because this is only a modification on a set of //-measure zero. 

Now, combining all of this, we see that g*Leb = i^/^Leb = (F o /)*Leb, and because of our 
above observations, F o / g Q , so bijectivity of \I/ gives that g = F o f. Thus, we see that 



To finish the proof, it remains to notice that continuous, strictly increasing functions are dense 
in (Qo,d L ). To show this, for example, for an arbitrary h G Qq, define h n to be the linear 
interpolation between the points {{i/n, /(i/ n )}f=0' anc ^ n °tice that 



tt = (Id, F)+n = (Id, F)*/*Leb = (/, F o 



/)*Leb = (/,sO*Leb 



is optimal, so for f,g £ Go continuous and strictly increasing 



(4.4) 



(*(f),*(g))=d L \f,g). 






Mo)) 



i 



< -. 

n 
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We're not quite finished; h n is certainly nondecreasing, but might not be strictly increasing. To 
fix this, define 

h n (x) := -h n (x) + - 

n n 

and notice that 

\\K - K\\ L 2 < - (\\hn\\ L 2 + \\x\\ L 2^j -)■ 

so, we have that h n is a sequence of continuous, strictly increasing functions converging to h in 
Go, finishing the proof. □ 

We must remark that as far as we know, Lemma 14. 1 1 is highly dependent on the one dimen- 
sionality of [0, 1]. A similar statement holds for the circle S , but it is not clear how to embed 
V(X) into a Hilbert space in some reasonable manner. In fact, as we will see in the next lemma, 
the image of the embedding is totally convex, allowing us to give a very concrete description of 
the geodesies in TV 

Lemma 4.2. The space Go is a totally convex subset of L 2 ([0, 1]) (that is, any geodesic between 
two elements in Go ^es entirely in Go)- In fact, for f,g£ Go, the unique geodesic between them 
is given by the linear combination 

7 (i) := (l-t)f + tg. 

Proof. For /, g 6 I? , we claim that the convex combination j(t) = tf + (1 — t)g is a L 2 -geodesic 
from / to g. Notice that 

\h(t) - l(s)\\ L 2 = \\(t- s)f + (s- t)g\\ L 2 = \t- - g\\ L2 

so it is thus a geodesic. Now we claim that these are the unique geodesies. Assume that for 
/ 7^ g, there is a h £ I? which is not in 7QO, 1]). This gives 

(11/ " h\\ L 2 + \\h - g\\ L 2) 2 = \\f\\ 2 L2 + \\g\\ 2 L2 + 2\\h\\ 2 L2 -2(f,h)-2 (g, h) 

+ 2\\f -h\\ L 2\\g-h\\ L 2 
> WfWh + \\9\\h + 2 \\Hh -2(f + g,h) + 2(f-h,h-g) 
\ 2 L 2 + \\g\\ 2 L 2 + 2\\h\\\ 2 -2(f + g,h)-2(f,g) + 2 (h, g) 
+ 2{f,h)-2\\h\\ 2 L2 
\h + \\gf L 2 -2(f + g,h)-2(f,g) + 2 (h, g) + 2 (/, h) 
\h + \\9\\h-2(f,g) 
= 11/-* 

with equality in the application of the Cauchy-Schwartz inequality if and only if there is A G R 
with f — h = X(h — g) (because neither term can be zero). This implies that (A + l)h = / + \g. 
Thus if A 7^ —1, this implies that 

Furthermore, we cannot have that A = —1, because that would imply that / = g. Thus, we see 
that for any h 7QO, 1]) a path from f to g going through h is strictly longer than 7, so we see 
that geodesies in L 2 are unique, as claimed, which clearly implies that Go is a totally convex 
subset of L 2 , with the claimed geodesies. □ 

5. Metric Sectional Curvature 

5.1. Toponogov's Theorem and Alexandrov Curvature. One of the oldest geometric 
notions which has been generalized to geodesic metric spaces is that of sectional curvature 
bounds. Because it is relatively simple (and we hope instructive) we sketch the relevant ideas in 
this section, and then discuss the relationship of sectional curvature of a Riemannian manifold 
(M,g) with the geometry of {V{M),d w ). 
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Suppose that (M, g) is a complete Riemannian manifold (we give a brief introduction to 
Riemannian geometry, along with many references in Appendix [B]) with nonnegative sectional 
curvature, which we write K{M) > 0. We will see that we can characterize non-negative 
curvature, K(M) > 0, by comparing the shape of "geodesic triangles" in M to appropriate ones 
in euclidean space, and this will turn out to be a condition that is only given in terms of the 
distance metric on M and not the Riemannian metric^ If p, q, r € M are three points, and 
7 P q, 7 gr , 7 pr are length minimizing geodesies between p and q, q and r and p and r, respectively, 
then we will call this a geodesic triangle. Clearly for any three points there is at least one such 
triangle with vertices at those points, but as the example of the sphere shows, there can be 
more than one such geodesic triangle with vertices at a give triple of points. In spite of this 
ambiguity, we will usually denote such a geodesic triangle by Apqr. 




Figure 3. A geodesic triangle and corresponding comparison triangle. 

Given a geodesic triangle Apqr in M, we consider a comparison triangle in R 2 , which is a set 
of points p,q,r of the same distance apart, i.e. 

|Ml = d M (p,q) = Li'jpg), \pr\ = d M (p,r) = L(^ pr ) and \qr\ = d M {q,r) = L{^ qr ). 

We will denote this comparison triangle by Apqr. These definitions are illustrated in Figure 
[31 By the triangle inequality for oIm, such a triangle always exists and from basic euclidean 
geometry such a triangle is unique up to rigid isometry once the side lengths are fixed. Now, 
let us consider the example of a standard sphere, (S 2 ,d S 2). Here, as illustrated in Figure HI 
a geodesic triangle is "fatter" than the corresponding comparison triangle. By this, we mean 




Figure 4. A spherical geodesic triangle. 

that, for example a point on j qr is farther (as measured by d^) from p than the corresponding 
point on qf in IR 2 . 

It turns out that this "fatness" property holds in all nonnegatively curved manifolds, as 
proved by Alexandrov for surfaces, and Toponogov for general manifolds 

''in fact, for a general K £ R, a similar "metric" version of K(M) > K and K(M) < K can be given by 
comparing to the simply connected spaces of constant curvature A' in a similar fashion to that described in this 
section. This more general Toponogov theorem is described in many places, for example in [25J Chapter 11] or 
Chapter 2]. The resulting condition on metric spaces leads to the notion of Alexandrov spaces in the case of 
K(M) > K and Hadamard spaces for K(M) < K. Further details can be found in [5]- 
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Theorem 5.1 (Toponogov). Given a complete Riemannian manifold (M,g) with K{M) > ; 
and a geodesic triangle Apqr C M, let Apqr Cl 2 be a corresponding comparison triangle. For 
a point x £ ^ qr , there clearly is a unique x £~qr with \qx\ = dM(q,x) and \xr\ = djv/(x,r). For 
this x, we have that 

(5.1) d M (j>,x) > \px\. 

If (M, g) is an arbitrary Riemannian manifold such that every geodesic triangle Apqr satisfies 
the above comparison inequality, then K(M) > 0. 

See Figure [5] for an illustration of such a triangle. We do not give proof, as it would take us 
too far astray (a proof can be found, for example, in [291 Chapter 11]). Notice that Toponogov's 
Theorem tells us that if we know the (metric) distance between any four points of M, we can tell 
whether or not K{M) > (in fact, all we really need is the corresponding distances for any three 
points p, q, r and a fourth x lying between q and r in the sense that d(q, x) + d(x, r) = d(q, r)). 
Thus, we have a characterization of a manifold (M,g) having K(M) > purely in terms of the 
length metric g?m! 




Figure 5. Comparing the lengths d]w(p, x) and \px\ as in Toponogov's Theorem. 
Here diwip, x ) — \px\i so this triangle satisfies the conclusion of the theorem. 



As indicated above, this allows us to generalize K{M) > to some metric spaces. If (X, d) 
is a geodesic space it is clear that the notion of geodesic triangles and comparison triangles still 
makes perfect sense, and we can turn Theorem 15.11 into a definition, generalizing the notion 
of nonnegative sectional curvature in such a way that the new definition is equivalent to the 
standard one when (X, d) is secretly a Riemannian manifold in disguise as a metric space. As 
such, we will say that a metric space (X, d) has nonnegative curvature (in the Alexandrov sense) 
in this case. 

5.2. Alexandrov Sectional Curvature of Wasserstein Spaces. It turns out that the sec- 
tional curvature of the base space is related to the Alexandrov curvature of the associated 
Wasserstein space 

Theorem 5.2. For (M,g) a smooth compact connected Riemannian manifold, M has non- 
negative sectional curvature if and only if V(M) has nonnegative curvature in the Alexandrov 
sensed 

See, for example [26} Theorem A. 8] or [33} Proposition 2.10]. The following is the same as 
[19\ Proposition 4.1], except that the given proposition is for the interval, instead of the real 
line. 

Proposition 5.3. The space (Po,d w ) has "vanishing Alexandrov cuvature" in the sense that 
equality always holds in f)5. 1 1) when comparing a geodesic triangle to a comparison triangle. 



It is important to note that even though many of the results stated in this essay are simplified (for example, 
we could easily generalize Toponogov's theorem to sectional curvature bounded from below by A" € K), the 
reasonable looking generalization of Theorem 15.21 to say that if M has sectional curvature bounded below by 
k then CP(M),d w ) has curvature bounded below by k in the Alexandrov sense is false. See, for example, [33 1 
Proposition 2.10]. 
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Proof. Vanishing Alexandrov curvature is equivalent to the following property: for [/,, u, r) G Vq, 
denoting by 7 : [0, 1] — > Vq the unique geodesic between v and 77, we have that 

(5.2) d w {^{t)f = (1 - t)d w (^ 7 (0)) 2 + td w {^{l)) 2 " *(1 " t)^(7(0),7(l)) 2 - 

To see this, we take a comparison triangle Apqr as in Figure EJ We choose the triangle so that 
p corresponds to /j, q corresponds to v, and r corresponds to r\. As in Figure El we label the 
angle ZJpqr by 9. If we choose coordinates so that q = (0,0), r = (\qr\,0), then we can take 



V 




Figure 6. Example comparison triangle Apqr for the calculation of \px\ in order 
to show that (|5.2p is equivalent to \px\ being equal to the corresponding length 
in V . 

P = (\pq\ cos9, \pq\ sin#). Then, x = (t\7jf\,0), so 

\px\ 2 = (\pq\ cos9 — t\qf\) 2 + (\pq\ sin^) 2 = \pq\ 2 + t 2 \qf\ 2 — 2t\pq\\qf\ cos9. 

However, by the law of cosines, we have that 

2|pg||gr| cos 9 = — \pf \ 2 + \pq\ 2 + \qf\ 2 , 

so combining these two gives 

\px\ 2 = (1 - t)\pq\ 2 + t\qf\ 2 - t(l - t)\pr\ 2 - 

Because Apqr is a comparison triangle, it is immediate that \px\ = d w (^L,"f(t)) if and only if 
(1BT21) holds. 

To show that (|5.2p holds, we will show the equivalent identity in Qq, which follows easily from 
the fact that L 2 ([0,1]) is a Hilbert space. For f,g,h £ Go, by Lemma 14.21 ^v(t) := (1 — t)g + th 
is the unique geodesic between g and h. Thus, it is equivalent to show that 

||/ - 7(t)||| 2 = (1 - t)\\f - g\\ 2 L2 +t\\f- h\\% - t(l -t)\\g- h\\%. 

To do this, we compute 

(1 - t)\\f - g\\ 2 L2 + t\\f - h\\ 2 L2 - t(l - t)\\g - h\\\ 2 

= \\f\\\ 2 - 2(1 - t) (/, g) L2 + (1 - t) 2 \\g\\ 2 L2 + t 2 \\h\\ 2 L2 - 2t (/, h)„ - 2t(l - t) (g, h) L , 

= ||/ - (1 - t)gf L2 - 2t (/ + (1 - t)h, h) L2 + t 2 \\h\\ 2 L2 

= \\f-(l-t)g-th\\ 2 L2 

= \\f-l(t)\\h, 

showing that (Vo,d w ) has vanishing Alexandrov curvature, as desired. □ 

6. Metric Ricci Curvature 

Now that we have seen the notion of Alexandrov curvature, we turn to a metric notion 
of Ricci curvature. It will turn out that convexity of certain functionals along geodesies in 
(V(X),d w ) provides a good generalization of lower bounds of Ricci curvature. To do this, we 
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fix a reference Borel probability measure m £ V(X), and we will consider the geodesic measure 
triple (X, d, m). We define the entropy functional! Ent(-|m) : V(X) ->K = lU {+00} by 



p i->- 



J x p log p dm for p <C m and p = pm 
+00 otherwise 



Intuitively, Ent(/i|m) measures the nonuniformity of p with respect to m. In some sense, we can 
think of it as a kind of distance (without symmetry or a triangle inequality) , which is "extensive" 
in the sense that if p, p' , m £ V(X) then Ent(// <S> p'\m ® m) = Ent(//|m) + Ent(//|m). 

Definition 6.1. We say that for the geodesic measure space (X,d,m), Ent(-|m) is weakly a.c. 
K -displacement convex if for any probability measures po,pi <C m, there exists a geodesic in 
V{X), pt from po to p\ so that Ent(^|m) is K-convex, in the sense that 

K 

(6.1) Ent(p t \m) < £EutOi|m) + (1 - i)Ent(/i |m) - —t(l - t)d w (p , pif . 

The "a.c." corresponds to requiring po,p± absolutely continuous with respect to m and the 
"weakly" corresponds to requiring (|6.ip to hold along only one geodesic, not all of them. By |26t 
Proposition 3.21], we could actually drop the a.c. requirement, but it will be more convenient 
to keep it. Furthermore, on Riemannian manifolds, it turns out (cf. |26t Lemma 3.25]) that the 
"weakly" condition is unnecessary, but for general geodesic measure spaces it will prove to be 
important when we consider stability under Gromov-Hausdorff convergence in the next section. 
One indication this is a good definition is given by 

Theorem 6.2. For a compact Riemannian manifold (M,g), regarding it as a geodesic measure 
space (M,d,m), with m = (vol(M)) vol/vf, we have that Ent(-|m) is weakly a.c. K-convex if 
and only if Ric > K on M|_| 

As such, we will often refer to a geodesic measure space (X, d, p) on which Ent(-|/i) is weakly 
a.c. .ftT-convex as a space with generalized Ric > K. 

6.1. Displacement convexity in Vq. Before proving the above theorem, we will first deal 
with our special case, M = [0, 1], which does not quite fit into the statement of Theorem 16.21 
because we will not discuss manifolds with boundaries. We could deduce Corollary 16.41 from 
Theorem 16.2} by considering [0, 1] as a small segment in 5 1 and then showing that a convex 
subset inherits Ricci bounds. However, it is more instructive to do things explicitly in this case, 
which is what we will do. 

Proposition 6.3. We have the following formula for the entropy functional on Vq. For f £ Qq, 
we have that 

(6.2) Ent(*(/)|Leb) = - / log f'(x)dx. 

Jo 

By Proposition 14.11 and Lemma 14.21 which relate geodesies in Vq to those in Qq (which are 
simply linear interpolation) this allows us show that 

Corollary 6.4. For po,pi £ Vq, letting pt be the unique geodesic between them, Ent(/ij|Leb) 
is a convex function (possibly taking the value +00). Thus, Ent(-|Leb) is a.c. displacement 
0-convex on [0, 1] . 

Proof of Proposition \6.3[ Because / is monotonically increasing, it is differentiable almost ev- 
erywhere. First of all, suppose that on some set of full measure, A C [0, 1], we have that /\a 
is injective and for all x £ A, f is differentiable at x with f'(x) > 0. In this case, we can see 
that by a nonsmooth change of variables (Lemma lA.l|) that _/*Leb is absolutely continuous with 
respect to Leb, and furthermore /*Leb = pLeb with 

p(f(x))f'(x) = 1 



''N.B., what we call "entropy" is actually Boltzmann's H-functional. 

^By this, we mean as a bilinear form, or in other words Ric > K if and only if Ric(£,£') > Kg(£,£') for all 
p e At and £, £' € T P M. See Appendix [B] 
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for a.e. x G [0, 1]. Thus, this gives that 

Ent(*(/)|Leb) = f p{y) log p(y)dy = [ p{f (x)) log p(f(x))f'(x)dx = - [ logf'(x)dx 
Jo Jo Jo 

as desired (the nonsmooth change of variables y = f(x) is justified by the area formula (jA.ip ). 

On the other hand, suppose that such an A does not exist. We will show that both sides 
of (|6.2|) are equal to +oo. If / is constant on some interval, it is not hard to see this, because 
implies that /*Leb is not absolutely continuous and fix) = on a set of nonzero measure. On 
the other hand, if there is A of full measure such that /\a is injective, but it does not hold that 
f'(x) > for a.e. x, then by Lemma |A. II we again have that /^Leb is not absolutely continuous 
with respect to Lebesgue measure, so again both sides of (j6.2|) are +oo. □ 



Proof of Corollary \6.4\ For f,g £ Go, the unique geodesic between them is given by 

h := (l-t)f + tg, 

by Lemma 14.21 Thus, 

Ent(*(fet)|Leb) = - f log((l - t)f'(x) + tg'(x))dx 
Jo 

<- f ((l-t)log(f'(x))+tlog(g'(x)))dx 
Jo 

= (1 - t)Ent(*(/)|Leb) + 1 Ent(W(#)|Leb) 
where we have used the concavity of log in the second step. □ 

6.2. Optimal Transport and Displacement Convexity in Riemannian Manifolds. To 

prove Theorem l6.2t we must first understand optimal transport on Riemannian manifolds. With 
this in mind, we fix a smooth compact Riemannian manifold (M, g), and write d for the induced 
length metric. 

Definition 6.5 (d 2 /2-concavity). A function <f> : M — > R is called d 2 /2-concave if there exists 
a function ip : M — >■ R so that 

1 



(j)(y) = inf 



2 d(x,y) 2 - ip(x) 



d 

for all y G M. For a function ip : M — > R, we define its d 2 /2-transform <ft~ by the same 
formula 

d 2 1 

cj)-2(y) ■= inf [-d(x, y) 2 - <f>(x)}. 

We will need the following properties of c? 2 /2-concave functions 

Proposition 6.6. A d 2 /2-concave function <f> is Lipschitz (and thus its gradient exists almost 
everywhere) and has an almost everywhere defined Hessian in the sense of Alexandrov. 

Here, we say that <f> admits a Hessian at x in the sense of Alexandrov if <fi is differ entiable 
at x and there is a self-adjoint operator H : T X M — > T X M satisfying the property that for all 
v G T X M 

V^V^Iz = Hv\ x 
where V v denotes covariant derivation in the v direction. 

This is proven in |10| or in [37\ Theorem 14.1]. The authors of |10j go on to prove in Corol- 
lary 5.2 the following characterization of Wasserstein geodesies between absolutely continuous 
measures on Riemannian manifolds. 

Theorem 6.7. For p, v G V(M) with p <C voIm; there is a d 2 /2-concave function <ft : M — > R 
so that the map 

F t {x) := exp x (—tV(p) 
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gives fit ■= : [0, 1] —> V{M), which is the unique geodesic between fi and v . Furthermore, 

(Id, i*x)*//o is an optimal transport plan between fiQ and fi±. For t E [0, 1) fit *C voIm and if, in 
addition, v <C vo1m> then we have that for all t G [0,1], ^ < voIm . 

This was originally proven by McCann in [27]. In particular, if we let V ac (M) C V(M) denote 
the subset of probability measures which are absolutely continuous with respect to voIm, then 
V ac (M) is a dense, totally convex subset of V(M). 

Proof of Theorem \6.2[ Assuming Ric > K: 

Lemma 6.8. For : M a d 2 /2 -concave function, define 

F t (x) := ex Vx (-tV4>) 

and assume that we have chosen y £ M such that 

(1) (p admits a Hessian at y (in the sense of Alexandrov) 

(2) Ft is differentiable at y for all t E [0, 1) 

(3) dF t (y) is nonsingular for all t G [0, 1). 
then V(t) := det(dF t (y)) 1 /™ satisfies 

^<-lRic(F lW ,F l(! ,)>. 

It will be important for later to notice that by Proposition 16.61 and [XQ^ Theorem 4.20 , 
assumptions (1), (2), and (3) are satisfied for a.e. y £ M. 

Proof. We fix a y with the above properties, and define a geodesic 7 : [0, 1] — > M by 

7(t) := exp y (-tV0(i/)). 

Choosing an orthonormal basis e\,... ,e n for T y M with e\ = t(0) /I't(O) | = — Vcp(y)/\V(p(y)\, 
we parallel transport the basis to a orthonormal frame e\(t), . . . , e n (t) along 7(i). 
Defining 

Ji(t) := (dF t ) y (ei) = exp^+^^^tV^^ + exp^tfe,)), 

this is a vector field along j(t) coming from a variation through geodesies, and is thus a Jacobi 
fieldJl Thus, for t G [0, 1), we can define a matrix Jij(t) := (Ji(t),ej(t)) and because the Jj are 
Jacobi fields, we have that by the Jacobi equation, (jB.lOp 

J(t) + R(t)J(t) = 

where 

R kj (t) := (Biem(e fc (*),7(t))7(t),e j (t)). 

Furthermore, notice that 

Ji(0) = -^\ x _ n Fo(y + exp y (5ei)), 



so 



and 



6=0 
J(0) = Id 



d 

It 
d_ 

dS 



M0) = ^ t ^\^F t (y + e Wy (6e t )) 
Ft(y + exp y (5ei)) 



<5=o dt 



d_ 

d6 



5=0 



t=o 

V0(y + exp J5ei)) 



This theorem provides the a.e. nonsingularity of dFt, i.e. assumption (3) 

classical reference for Jacobi fields is Milnor's 28 , but 29, Chapter 5] and [151 Section 3.C] also contain 
useful introductions to the topic. We give a brief overview of Riemannian geometry, including Jacobi fields in 
Appendix [B] 
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= -v ei v<^ 

= -Hess(0)| y e;. 

Here we have written V ei for the covariant derivative in the i-th direction. Thus, 
^(°) = J t \ t=0 < J ^)> e iW> = (ji(t),ej(t)) = - (V ei (V0), ei ) 

so we can write 

j(0) = -V(V0). 
Letting ?7(i) := J(t)J(t)~ l , we thus have that 

(6.3) U(t) = J{t)J(ty l - j{t)J{t)- l j{t)J{t)- 1 = -R(t) - U(t) 2 . 
Taking traces, we thus have 

(6.4) j t tr{U{t)) + tr{U{t) 2 ) + tr R(t) = 
and it is clear that 

(6.5) trfl(t)=Ric 7(t) (7(t),7(t))- 

We claim that U(t) is symmetric. To see this, notice that 17(0) = j(0)J(0)" 1 = -V(V0), 
which is a symmetric matrix, by definition, and both U and its transpose satisfy the first order 
system of ODE's (|6.3p . with the same initial conditions, so they must be equal. This allows us 
to note that 

TL TL / Tt *\ 

tr(U(t) 2 ) = UyWji® = E(^i(*)) 2 > ~ = \tr(U(t)f 

i,j=l *,i=l \i=l / 

so, combined with (|6.4p . this gives a differential inequality for trU 

(6.6) 4- te(U(t)) + - tr(U{t)) 2 + tr R(t) < 0. 

at n 

Now, letting 

J{t) := det J(t), 

we have that by assumption (3) in the statement of the lemma, J is invertible for all t, so it is 
a standard fact that 

(6.7) j(t) = J(t) tr( j(t) J(ty l ) = J(t) tr U(t), 
so 

±tr(U(t)) = -( ^ {t) ^ - J{t) ^^ ~ ( ^ (t))2 



dt x wy dt yj(t) J {J{t)) 2 

and 

i(,, ( t/«)) 2 - m? 



n y v v " n(J(t)) 2 
so these combine with (|6.5p and ()6.6p to give 



(0.8) l-Hlw+W^ 1 
Now, to finish the proof, notice that T>(t) = J(t) l l n , so 

(6-9) 4£ = ^ 

V(t)V(t) - (V(t)) 2 J(t)J(t) - (j(t)) 2 



n- 



(v(t)f (j{t)f 

This implies that 

v{t) j(t) / i\ (j(t)) 2 



n 



X>(t) J(t) 
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and noticing that j(t) = Ft(y), this finishes the proof. □ 

For no, [ii € V(M) with Ho, Hi ^ m = (vol(M))~ 1 voljv/ , let <p be a ^--concave function such 
that the map 

F t (x) : = exp x (-iV£) 

gives Ht '■= (Ft)*Ha '■ [0, 1] — > M, which is the unique geodesic between ho and Hi- By Theorem 
16.71 Ht *C m, so we can define r]t by Ht = f]vm. Then, by the nonsmooth change of variables, 
Lemma lA.21 we have that the densities obe3tj 

(6.10) no(x) = m(F t (x)) det{dF t (x)). 

Using a nonsmooth change of variables again, combined with (|6.10p gives 

Ent(//f|m) = / r]t(x) log rjt(x)dm(x) 
Jm 

log(r) t (F t (x)))ri t (F t (x))det(dFt(x))dm(x) 

M 

lo S ( a vT J Vo(x)dm(x). 

M \det(dF t {x)) J 

Thus, defining 

C(x,t) :=logdet(dF t )(x) = nlogV(t), 
we can rewrite the previous equation as 

(6.11) Ent(/ii|m) = / (log(r/o(a;)) — C(x, t)) r]o(x)dm(x). 

Jm 

Because we've assumed that Ric > K, for almost every x £ M we have that 

C(x,t) = n V{t)V ^ {t ~ m2 <ng£j <-Ric(F t (x), F t (x)) < -if|^(x)| 2 = -i^|V0| 2 (x) 
which implies that 

C(x,t) + -^|V0| 2 (x)j <0, 



so it is concave. Thus 

C(x, t) + ^K\Vcp\\x) > t (C(x, 1) + ^K\ Vct>\ 2 (x)j + (1 - t) (C(x, 0)) 

C(x, t) > tC(x, 1) + (1 - t)C(x, 0) + —t(l - t)\ V(f>\ 2 (x). 
Inserting this into (|6.1ip gives 

K f 

Ent(Ht\m) < tEnt(m\m) + (1 - t) Ent(/i |m) - —t(l - t) \V(j)\ 2 (x)r] dm. 

2 Jm 

Furthermore, as d(x, Fi(x)) = \ V(j)\(x) and because the transport plan (Id, -Fi)*/io is optimal by 
Theorem 16.71 we have that 



(6.12) d W (Ha,Hif= I d(x,Fi(x))dHo(x)= [ |V0| 2 (z)^ o (x) = f |V0| 2 (x)r ?o (x)dm(x) 
Jm Jm Jm 



and thus 



K 

Ent(Ht\m) < iEnt(^i| m ) + (1 - t) Ent(/i |m) - —t(l - t)d W (ho, Hi) 2 , 



as desired. 

Assuming that Ent(-|m) is weakly a.c. displacement K-convex: Fix x 6 M and w £ T X M. 
We will show that Ric(-u,w) > iTl?;^. Choose a smooth positive function 770 with 770 > in 
Bs(x) and supported in B2s(x) for a (5 > which we will fix later. Furthermore, we will assume 
f M r]o(x)dm = 1. Thus, letting ho '■= Vo m , this gives ho £ V{M) and /Uq <C m. Now, take 



^The cited lemma requires almost everywhere injectivity of Ft, which is established in [101 Lemma 5.3]. 
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a smooth function <j) : M — > K with V^(x) = —v and A(j)(x) = (for example, in normal 
coordinates around x such that v = d±, multiplying —x\ by a smooth cutoff function that 
vanishes away from x satisfies this requirement). For e > 0, small enough e<j) is ^--concave by 
[37] Theorem 13.5]. This implies that if we define F^{x) := exp x (— eV^(x)), Theorem 16.71 gives 
that fit := (FD^fiQ is the unique optimal transport between fiQ and fi\ := (Fl)^fiQ. By making 
e smaller if necessary, we have that fi\ <C m (this follows from Theorem 16.71 because we know 
that the original fit <C m for t G [0, 1)). Because we have assumed that Ent(/Xt|m) is K-convex, 
it is not hard to see that this implies that 

t 2 \" 
Ent(fi t \m) + -Kd w (fi ,fi 1 ) 2 j >0 

if it is differentiable in t (which it is, in this case). Using (|6.12p . we have that thus 



dt 2 



2 r / ,9 \ ill 



f M ^-Ke 2 \V^\ 2 {x)-C(x,t)^ 



rio{x)dm(x) 



> 



Because everything is smooth, we can differentiate under the integral and then letting 5 and 
then e tend to zero, this implies that 

(6.13) - C(x, 0) > e 2 K\V(j)\ 2 {x). 

However, for any fixed e, 5, we have that, as in the above proof 

Now, reminding ourselves of the definition of T> and U, we see that, by (|6.7p and (|6.9p 

n W) = W) = trC/(0) = - trV ^))^ = = °- 

This, combined with the fact that we have equality in ()6.6p at t = (again because tr U (0) = 0) 
gives that 

C(x,Q) = -e 2 Ric :r (-V^,-V</>) 

so 

-C(x,0) = e 2 Ric x (v,v) 
and this, combined with (|6.13p . yields the desired 

Ric x (v,v) > K\v\ 2 . □ 



7. Gromov-Hausdorff Convergence 

7.1. Definition and Basic Properties. One of the interesting properties of our new notion 
of Ricci curvature bounds for metric spaces, as in Definition 16.11 is that it is preserved under 
a reasonably weak notion of convergence of metric measure spaces, called Gromov-Hausdorff 
convergence. 

Definition 7.1. For X,Y compact metric spaces and e > 0, we say that a (not necessarily 
continuous) map f : X — >■ Y is an e-isometry if the following properties hold 

(1) forx,x'eX, \d x (x,x')-d Y (f(x),f(x'))\<e 

(2) for all y EY there is x £ X with dy(y, f(x)) < e. 

Definition 7.2. A sequence of compact metric measure spaces (X n ,d n , fi n ) converges in the 
(measured) Gromov-Hausdorff topology to a compact metric space (X,d,fi), written 

(X n ,d n ,fi n ) >■ (X,d,fi) 

if there is a sequence e n — > such that there are e n -isometries, f n : X n — > X such that 
(/n)*A*n I 1 (in the weak* topology on V(X)). 
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We will say that (X n ,d n ) — > (X,d) in the (measureless) Gromov-Hausdorff topology if the 
above holds without the condition that the e n isometries push any measures forward. We note 
that if (X n ,d n ) are (compact) geodesic spaces, then so is (X, d). To see this, recall that by 
Lemma 12.11 a necessary and sufficient condition for a space to be a geodesic space is that 
midpoints exist. Taking x,x' E X, there are x n ,x' n G X n such that 

d{fn(x n ),x),d(f n ( X n ), X ) < € n . 

Because (X n ,d n ) is a geodesic space, by assumption, Lemma 12.11 shows that we can find a 
midpoint z n G X n such that d n (z n ,x n ) = d n (z n ,x' n ) = ^d n (x n ,x' n ). Because /„ is an e n 
isometry, we then have that 

\d[fn{Xn)i /n(-2n)) d n (x n , Z n ) | ^ 6 n 

Now, by extracting a subsequence we may assume that f n (z n ) converges to z G X (because X 
is compact), and thus, passing to the limit in the above gives 

d(x,z) = lim d n (x n ,z n ) = lim \d n {x n ,x' n ) = -d(x,x'). 

n— yoo n— >oo 2 Z 
A similar consideration for z and x' shows that z is a midpoint for x and x', so (X, d) is a 
geodesic space. 

We quickly give a few examples of Gromov-Hausdorff convergence (without proof as most of 
these examples are geometrically believable but actually proving them would be quite tedious 
and not particularly instructive). For (X, d) a metric space of bounded diameter, we have the 
convergence 

(X,Xd) ^({*},d,) 

as A — > (where ({*}, d*) is the metric space with one point). A slightly more exciting example 




Figure 7. One of the simplest examples of a singularity developing under 
Gromov-Hausdorff convergence. The metrics on the spaces are all the induced 
length metric from the ambient M 3 length structure, i.e. the distance between 
two points is the infimum of the lengths of curves in the surfaces between two 
points (as measured in M 3 with the standard metric). 

is given in FigureO in which we see hemispheres embedded in M 3 developing a conical singularity 
under (measureless) Gromov-Hausdorff convergence. It also turns out that topology need not 
be preserved under Gromov-Hausdorff convergence. Figure [8] gives an example of a small handle 
shrinking away on a sphere. All of the original spaces are topologically tori, but the limit space is 
not. Finally, we mention an example in which the dimension decreases in the limit. Recall that 
the Hopf fibration can be thought of a Riemannian submersion, <S 3 (1) — > S 2 (^) (where <S 2 (^) 
is the standard 2-sphere of radius ^. For e > 0, we define a new metric on 5 3 by declaring that 
(what were originally) unit vectors tangent to the Hopf fibers have length e, and unit vectors 
perpendicular to the Hopf fiber have length 1. This results in a 1-parameter family of metrics 
on giving rise to what is known as Berger spheres; cf. |29} Example 11]. It turns out that 

as e — > 0, (S 3 ,g e ) S 2 (^), so we see that the dimension can decrease under convergence 
io A 

simpler example of decreasing dimension under Gromov-Hausdorff convergence is given, for example, by 
the squares [0, 1] x [0, e] C K 2 with the induced length metric converging to the interval [0, 1] as e — > 0. Dimension 
can also increase under convergence, as is illustrated by a finer and finer grid in [0, l] 2 (with the induced length 
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Figure 8. Spheres with a small handle converging to a sphere with no handle 
in the Gromov-Hausdorff topology. This shows that the topologies of the limit 
space may be different from the spaces in the sequence. 



We will not dwell on further properties or examples of Gromov-Hausdorff convergence. A 
quick introduction to Gromov-Hausdorff convergence can be found in [11] and a more detailed 
exposition can be the found in the book by Burago, Burago, Ivanov [5j. Gromov's book, [T7] is a 
more difficult read, but contains numerous fascinating examples and remarks and we recommend 
it highly. 

7.2. Stability of Weak Displacement Convexity. One important property of our new 
definition of lower Ricci curvature bounds is that they are preserved under Gromov-Hausdorff 
convergence. We will give an overview of the proof, relying on several results from [26], those 
concerning Gromov-Hausdorff convergence. 

Theorem 7.3. // (X n , d n , /%) > (X,d,p) and (X n ,d n , p n ) all have Ent(-|/i n ) weakly a.c. 

K -convex, then the same holds for (X, d, p) . 

We remark that we have already seen this phenomena in our above examples. For example, 
in Figure [7J the sequence all has nonnegative Ricci, showing that the cone has generalized 
nonnegative Ricci. Similarly (cf. \29\ Section 4.3]), one can show that for K < 4 there is e small 
enough so that Ric(g3 )ffe ) > K and clearly S 2 (^) has Ric > 4. 

Proof. Let f n : X n — > X be e n -isometries with e n — > 0. Take vo,vi G 'P(X) with v$,v\ <C p. 
We can write i>{ = pi(x)p. Without loss of generality, by [26} Lemma 3.24], we may use an 
approximation argument to assume that the pi are continuous. Thus, we have that 



1 = / pi{x)dp{x) = lim / Pi(x)d[(f n )*p n ](x) = lim / pi{f n (x))dp n {x). 
Jx n ^°°Jx n ~^ 00 Jx n 

This implies that for large enough n, defining functions 

~(n) ._ Pi{fn{x)) 



Sx n PiUn{x))dn n {xY 



gives v\ := p\ {x)p n £ V(X n ). Thus, by assumption, there is a geodesic in V{X n ) 
between and p| n ^ such that 

(7.1) Ent(£ t (n Vn) < tEnt(pJ n) |/in) + (1 - t)Ent(^Vn) + yt(l - t)d w {y^\ v^f ■ 

Now, to complete the proof, we will pass to the limit as n — > oo. First, we need that 
(V(X n ),d w ) ^> (V{X),d w ), which allows us to find a geodesic in V{X) which is the "limit" 
if u[ n ^ in the appropriate sense. 



metric) converging to [0, l] 2 . However, with control over the Ricci curvature of the sequence of spaces, the 
dimension of the limiting space cannot increase (this is discussed in [6]). 
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Lemma 7.4. The induced maps (/«,)* : V(X n ) — > V(X) are e n isometries where 



l n := 4e„ + v / 3e„(2diamX) +3e n . 

This is proven in [261 Proposition 4.1]. As a result of this, up to extracting a subsequence, 
the (possibly discontinuous) maps t i— > (fn)*^t tend uniformly (this notion makes sense even 
if the maps are not continuous) to a geodesic vt £ ~P(X) between v§ and v\. The proof of this is 
basically the same as Arzela Ascoli, and is a generalization of the proof of the "isometry lemma" 
in [16] on page 66. Notice that this is exactly why have assumed weak displacement convexity. 
It is not clear that we can find geodesies in V(X n ) which converge to any geodesic between i/q 
and v\ as above. There are examples of Gromov-Hausdorff convergence of length spaces where 
not all geodesies in the limiting space come are limits of geodesies in the original spaces 

Thus, given that (fn)*^t ~~ ^ v t uniformly, we would like to take the limit in (|7.ip . By 
Lemma El d w (D^ n) ,i)[ n) ) — > d (vq, Furthermore, for i = 0, 1 

Ent(z>| n) |/Xn) = / pf l \x)\ogpf\x)dp n {x) 

Pl{Ux)) ■ log ( r p ;[ u ^] , , ) d» n ( x ) 



X, 



Ix n Pi(fn(x))dfJ, n (x) yf Xn Pi(fn(x))dfX n (x) 

Pi( x ) , ( Pi( x ) \ jut \ U \ 

log T — d[(f n )*lin](x) 



X I Xn Pi(fn(x))dfJ, n {x) yj Xn pi{f n {x))dfl n (x) 

► J ^^y^ log ( — ^\dn{x) 

= F,nt(ui\p) 

This, combined with (|7.ip shows that 

t \ K 
liminf Ent(PrVn) < tEntivAp) + (1 - t) Entfi/ol/i) + —til - t)d v [uq,^) 

Finally, to conclude the desired inequality, it is enough to show that 

(7.2) Ent(i/ t |/i) < liminf Ent(P t (n Vn) 

n—yoo 

by general functional analysis arguments combined with the Legendre transform representation 
of the entropy functional. We give a quick overview of this argument, which is a vastly simplified 
version of the proof of [26} Theorem B.33]. The proof is somewhat obtuse and could certainly 
be skipped without much harm. For a compact metric space (Y, d) with probability measure to, 
we claim that for any measure pm which is absolutely continuous with respect to m, we have 
the representation 

(7.3) Ent(pm|m) = sup I / (pip — e ip ~ 1 )dm 
To see this, first, notice that 

a ln a > ab — e fe_1 



standard example of Gromov-Hausdorff convergence in which not all of the geodesies in the limit space 
are limits of geodesies in the converging spaces is as follows (this is 37, Example 27.17], but I do not know of 
its real origin). Consider the metric spaces (M",d eP ), where d eP is induced by the norm ||a:||£j> = (5^™=i l a; | p ) 1/ ' P - 
It is not hard to see that as p — > oo (R n ,d £P ) GH ) (M. n ,d e ) where d l is induced by ||a;|| = sup 1<i<n \xi\. 
When p < oo, geodesies are simply straight lines between the two endpoints, but for p — oo "taxi cab" paths, i.e. 
straight lines in one coordinate and then another and so on are also geodesies and it is not hard to see that the 
majority of geodesies for p = oo are not limits of p < oo geodesies. I do not know an explicit example where we 
have the case we are worried about, i.e. (Xi,di) GH ) (X,d) but there are geodesies in V(X) that are not limits 
of geodesies in V(Xi), but it seems likely that with some work the previous example could be used to give such 
an example. 
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for a,b £M, which can be seen by maximizing the right hand side in b, so letting a = p(x) and 
b = tp(x) and integrating with respect to m gives that 



Ent(pm|m) > sup 



yj (pip - eV- l )dm)j . 



On the other hand, define, for M £ N 

' M p(x) > M 



Pm{v) ■-- 



p(y) pe [£,M] 
,JI P(V)<TJ 



and let 

Tp M (x) = log p M + 1. 

Finally, it is standard that there are continuous (pM,k 6 CPO which have tpM,k(y) — > log Pm(u) + 
1 for m-a.e. |/ £ 7. Thus, by dominated convergence 

lim lim / (pipMk — e ipM < k ~ 1 )dm = lim / (p(log pjvf + 1) — pM)dm = / plogpdm, 

M— >oo k— yoo Jy M— »oo Jy Jy 

for e > 0, we see that we can take large enough M and then k such that 

{pfM,k ~ e' PM ' k ~ 1 )dm > Ent(pm|m) — e. 



This establishes (|7.3p . This seems like a rather useless representation, but in fact it allows us 
to show (|7.2p . To do so, we must first make three observations about this new representation. 
First of all, notice that we can rewrite (|7.3p as 



Ent(/3|m) = sup ( / tpdp, — 1 dm 
•fi&c(Y) \Jy Jy 



and if /3 is not absolutely continuous with respect to m, then both sides are infinite (take tp 
tending to 1 + lnM %Ai where 13(A) > but m(A) = and then let M — > oo). Secondly, notice 
that we also can take the supremum over L°° 

Ent(/3|m) = sup I / tpdp, — / e?~^dm 
<peL°°(Y) \Jy Jy 

Finally, for a fixed tp G C(Y), the map 

G8,m)i-> J tpd/3- J e v-1 dm 

is a continuous functional on C(Y)* © C(Y)*, so it is standard that the supremum is lower 
semicontinuous in both variables. Thus, we can now establish (|7,2p . because this shows that 

Ent(u t \p) < liminf Ent((/ n )*i> 4 (n) |(/ n )*/i n ) 

n— >oo 



= liminf sup ( / tpd(f n ) ilt i>f ) - / e v 1 d(/ n )*/i„ 
n - > °° tp€C{X) \Jx Jx 

= liminf sup ( / tpd(f n )^v { f ) - I e lfi ~ 1 d(f n )^p n ) 

n ^°° <peL°°(X) \JX JX J 

= liminf sup ( / tp o f n dv\ n ^ — I e' fi0 ^ n ~ 1 dp n ) 

n ^°° ^GL°°(X) \JX n Jx n J 

< liminf sup ( / ipdVf — / e' fi ~ 1 dp n J 
(n). 



Ent(z> t l Vn)- 
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In the first inequality, we used the fact that we have shown that Ent(-|-) is lower semicontinuous 
in both variables, and in the second inequality that {99 o f n : ip £ L°°(X)} C L°°(X n ). This 
completes the proof. □ 

Thus, because we see that generalized Ricci lower bounds are preserved under Gromov- 
Hausdorff convergence, this theory gives a partial answer to the natural question posed by the 
following theorem of Gromov, which says that a set of Riemannian manifolds of a fixed dimen- 
sion, with a uniform diameter upper bounds and Ricci curvature lower bounds is precompact 
in the Gromov-Hausdorff topology. 

Theorem 7.5 (Gromov's Compactness Theorem). If (M k ,dk, fJ>k) is a sequence of compact 
Riemannian manifolds (where dk is the induced length metric and \x k = {^o\{M k ))~ l \o\M k is 
the normalized volume measure) with dimM^ = n, RicM/% > K and diamM& < D, for a fixed 
n, K G K, D > ; then there is a subsequence (M k . , d kj , fJ>kj ) an d a metric measure space 

(X,d,fjt) such that (M kj , d kj , fi kj ) > (X,d,(i). 



This is proven in, |29l Corollary 30]. This theorem leads to the natural question of what 
properties are inherited by a geodesic measure space in the closure of the set of such manifolds. 
We have just shown that the limit spaces have the same lower bounds for Ricci (in the weak 
a.c. displacement convexity sense we have developed above). Further properties of these limit 
manifolds can be found in Section IC.21 

7.3. Approximations of Vq. The notion of Gromov-Hausdorff convergence allows us to make 
finite dimensional approximations of (Vo,d w ). In this section, we will work with measure- 
less Gromov-Hausdorff convergence (i.e., drop the the associated measures, along with the 
requirement that (f n )*fJ, n — > //), and then later we will use our results to discuss measured 
Gromov-Hausdorff convergence! 12 ! 

We construct a sequence of metric spaces converging to (Go,d L2 ) by associating to each 
partition 

{U}f =1 = {0 = t < h < ■ ■ ■ < t N < t N+1 = 1} 
a metric space (X ({ti}) , dtj and a map 

f^ :(X({t t }),d u )^(g ,d L2 ) 
which is an isometry onto its image. To do this, we define the set X({ti}f =x ) to b^B 

(7.4) S w := {(xt, x N ) £ [0,1} N : = x < xi < ■ ■ ■ < x N < x N+1 = 1}, 
and then define a map 

N 

(7.5) f {u] {x u ...,x N ) = ^2x iX (u,u+i] e So 

i=i 

Notice that 

N 

||/ {ti} (xi 5 ...,x N )- f {u} (yi, . . . ,y N )\\ 2 L 2 = \ x i ~ Vil^i+i ~ U\, 

i=i 

so because of this we define 

N 

d^((x), {y)f := \xi ~ Vi\ 2 \t l+ i - U\ 

8=1 

and clearly this turns /** into an isometry onto its image. 



12 We do not sacrifice much generality by considering measureless Gromov-Hausdorff convergence first, as 
Prokhorov's theorem guarantees that for any sequence of fj, n associated to the (X n ,d n ), some subsequence of 
(/n)*M« converges. 

^Notice that X({ti}) does not actually depend on the actual partition, but instead just on the number of 
elements in the partition. However, we label it this way because it will rapidly get confusing when we have 
different partitions in one expression and hopefully this should make things slightly simpler. 



28 OTIS CHODOSH 

Proposition 7.6. For any sequence of partitions o/[0, 1], {i^ li^i such that the longest interval 
in the k-th partition shrinks to zero 

S k := sup (tgi - *S fc) ) 0, 

0<i<N k 

we have the Gromov-Hausdorff convergence 



Proof. To show Gromov-Hausdorff convergence, it is sufficient to show that there are — > 
such that the /*« are isometries, i.e. for all (x), (y) G X({t^}), 



| ||/^ ) ((x))-/^ ) (( y ))|| L2 -^ fc) ((a;),( y ))|<e, 
and for any g £ Go there is (a;) G -X"({i^}) such that 

llff-/* <W ((a?))llLa < 
.CO 

The first property is clearly satisfied because / » is an isometry onto its image, so we will 
establish the second property. For any g £ Go, we have that 

N rt W 



g - f?\g tl , . . . ,g tN )f L2 = £ /"^ (<?(*) " >)) 

i=0 ^ 



i=0 " l < 
2V 

i=0 



< sup (i™ 
V 0<i<N 



\ JV 
/ i=0 

i=0 



0<i<JV 

= { sup (e -*?')) 

= 4 

Thus, = 2\f5k satisfies the above requirements. □ 

We will often refer to the "standard" partition, defined by 

f 12 2^ — 1 1 

(7-6) T$ d = |0 = t < w < ^ < ■ ■ ■ < < V = 1 j . 

Writing (X^,^) for the spaces associated with T/v as in the above, Proposition 17.61 shows 

that (X% d ,d N ) ^ (Go,L 2 ). 

It is instructive to relate the form of the efc-isometries in Proposition 17.61 i.e. the maps 

f** : -> Go 

to the composition 
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.(AO 

where is the map coming from Proposition 14.11 because we clearly have that ^ o f l i are 
efc-isometries. It is not hard to see that 

N 

^o/« (x-i, . . . ,x N ) = y \t i+ i - tj)8 x .. 

This somewhat motivates the following proposition, which we will not prove, as we will not 
make use of it later. 

Proposition 7.7. Letting [0, l] n /6 n denote n-tuples of points in [0,1], quotiented out by per- 
mutations and d n the quotient metric coming from the euclidean metric on [0, l] n , then the 
maps 

f n :([0,l} n /e n ,d n )^(V ,d W ) 



1 

{xi, . . . , x n } i-)- - V" 6 X 



n 

i=l 



are e n -isometries with e n — > 0, so 

({0,l] n /6 n ,d n )^(V ,d" 
Work relating this notion and ideas discussed in the next section can be found in [3]. 

8. Measures on Vo and Associated Ricci Curvature Bounds 

From Proposition 15.31 we know that (Vo,d w ) (or equivalently {Qo,d L2 )) has nonnegative 
(actually vanishing) sectional curvature (in the generalized sense) . By analogy with the classical 
Riemannian case, we expect that this implies that it has nonnegative Ricci curvature in our 
new generalized notion. However, to make sense of this, we must choose a reference measure, 
and there is not really an obviously good choice. For example, Petrunin has shown in [30J that 
if a space with nonnegative sectional curvature in the sense of Alexandrov has finite Hausdorff 
dimension, then the space with the appropriately normalized Hausdorff measure has nonnegative 
Ricci curvature^ However, it seems that there is no useful notion of infinite dimensional 
Hausdorff measure, to try to generalize this result to the (Go,d L ) case directly. On the other 
hand, von Renesse and Sturm, in [38J, have constructed a reasonable candidate measure, which 
we will refer to as the entropic measure. However, as we will see, it suprisingly does not admit 
lower Ricci bounds at all. 

8.1. Analytic Consequences. We are not just interested in a measure with lower Ricci bounds 
on {Go,d L ) for its own sake (although, it is certainly a natural question that we feel is worth 
studying for its own merit). One reason why we might be interested in this is that there are 
useful analytic consequences of having lower Ricci bounds, including log Sobolev, Talagrand and 
Poincare inequalities. We will briefly discuss some of these inequalities. The following theorems 
are proven |26j in the setting of generalized lower Ricci bounds. 



Theorem 8.1 ( |26| Corollary 6.12). For (X,d,v) with generalized Ric > K, if f, a Lipschitz 
function on X satisfies J x f 2 dv = 1, then 

(1) IfK>0 then 

(8.1) J f log(f)du < -| J \Vf\ 2 dv. 

(2) IfK<0 then 

(8.2) J f 2 \og{f 2 )dv<2&mm(X)JJ \ V f\ 2 dv - diam(X) 2 . 



14 Actually, in this paper, Petrunin proves a stronger result, saying that if the space is m-dimensional with 
nonnegative sectional curvature, then the space has nonnegative generalized m-dimensional Ricci curvature, a 
notion which is discussed both in [26] and in [34| . In particular, the version we give in the text follows from the 
"Main Theorem" in [30] and [3l Proposition 1.6]. 
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Note that in the above, we have defined 

1*7*12/ \ i- \f( x ) - f(y)\ 

|V/| (*) :=hmsup ^ 

which clearly agrees with the standard notion of the norm of the gradient on a manifold and for 
differentiable f . 

As a consequence of (|8.1|) . it is possible to derive a Poincare inequality for geodesic measure 
spaces with generalized positive Ricci curvature. 

Theorem 8.2 ([26] Theorem 6.18). // (X, d, v) has Ric > K for some K > 0, for all Lipschitz 
f on X with J x fdv = we have that 

(8.3) / h 2 dv < \- f \Vh\ 2 dv. 

Jx K Jx 

Log-Sobolev and Poincare inequalities have proven useful in a wide variety of applications. 
For example, [36] describes applications of log-Sobolev inequalities to functional analysis on 
manifolds, information theory, hypercontractivity estimates, concentration of measure, and en- 
tropy production in dissipative equations. Furthermore, Poincare inequalities have become 
replacements for "spectral gap" inequalities when there is no Laplacian available ([37, Chapter 
30]). This gives some indication that lower Ricci bounds would have interesting implications 
for analysis on (X, d, v). 

8.2. Failure of Ricci Bounds for the Entropic Measure. Von Renesse and Sturm con- 
struct a measure Qq on Qq in |38|. Proposition 3.4]. First, we will describe the construction 
and then explain why one might expect that (Qo,d L2 , Qq) nas nonnegative generalized Ricci 
curvature (or at least Ric > K for some if gK), which we will then show fails to be true. 

Proposition 8.3. For (3 > there is a (unique) probability measure Qq E V(Qq) which we 
will call the entropic measure (but is also known as the Dirichlet process,) such that for each 
partition of [0, 1] 

= to < h < ■ ■ ■ < t N < t N+ i = 1 
and for all bounded measurable functions u : [0, 1]^ — > R, we have that 



N 

u(xi, ...,x N ) Y\ix i+ i - Xif^+^^dxx ■ ■ ■ dx N 



(8.4) / u(g(t 1 ),...,g{t N ))d®Z(g) 

Jg 



i=0 



where we define S^r as in (|7.4j) by 

Sat := {(xt, x N ) E [0, 1]^ : = x < x\ < ■ ■ ■ < x N < x N+1 = 1}, 
and r(s) = Jq 00 t s ~ 1 e~ t dt is the Gamma function. 

Furthermore, for each partition of [0,1], {t^fL-^, defining a measure /^{tj on (X({ti}, d^^) 
(these spaces are defined in Section pO|) by requiring that for u E C(X({ti})) 



1.5) / u{xi, . . . ,x N )dn {t .\ 
Jxau}) 



/ u(xi,...,x N ) ~[\(x i+1 -Xi)^ u+1 u) 1 dxi---dx N 



and writing /u^v ols the measure associated to the standard partition, as defined in (|7.6p . we have 
the measured Gromov-Hausdorff convergence 

(8.6) (X s N u ,d N , m ) {Go,d L \^). 
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Proof. Equation (|8.4p certainly does not define how to integrate against all continuous functions 
on Go- To show that Qq is well defined and unique, von Renesse and Sturm, in [38], make use 
of the Kolmogorov extension theorem. We give a more direct proof by first showing that there 
can be at most one measure satisfying fj8.4[) and then showing that the pushforward of the 
/i jv forms a Cauchy sequence in (V(Go),d w ) (note that here, in an abuse of notation, d w is 
Wasserstein distance on V(Go))j with its limit satisfying (|8.4p . This will complete the proof of the 
proposition, because we have already shown the (measureless) Gromov-Hausdorff convergence 

(X N ,d N ) ^ {Qo,d L2 ) in Proposition EH 



Lemma 8.4. There can be at most one measure satisfying ([87 
Proof. In the proof of Proposition 17.61 we showed that for g G Go 

\\g-f N (g(l/2 N ),...,g((2 N -l)/2 N ))\\i 2 <± 

where / is the map defined in (|7.5p . associated to the standard partition. This shows that for 
any continuous u G C(Go), u o f N — > u uniformly on Go- Thus, if Qq exists 



u(g)dq p Q (g) = Jim / u o f» (g)dQ^ (g), 



g N ->°°jg 

and assuming that Qq satisfies (|8.4|) . the right hand side of this equation is fixed for any N, 
and thus because the limit exists, the limit is uniquely specified. □ 

Proposition 8.5. The push-forward measures associated to the standard partitions 

{(f N )*(J>N}N=l,2,... 

form a Cauchy sequence in (V(Go)>d w ) (where d w is the Wasserstein distance on V(Go))- 

Before proving this, we recall the following integral identity, which plays a crucial role in the 
existence of Qq. We follow the proof given in [201 Section 1.5]. 

Lemma 8.6 (Euler's Beta Integral Identity). For real numbers a < b and a, f3 > 0, we have 
the following identity 

\t - a) a - l (b - tf~ l dt = (a - br+P- 1 . 

T(a + p) 

Proof of Euler 's Beta Integral. By using the change of coordinates 

t = a + (a — b)t, 

it is sufficient to show that 

(8.7) /"V 1 (l-^- 1 cft- r(o)r03) 



o T(a + (3) 

To show this, first recall that one standard representation of the T function is 



poo 

r(x) = / t x - l e- l dt 

Jo 



and making the change of coordinates t — > rt for r > 0, we have that 



t x ~ l e- rt dt. 



o 



We introduce the variable u = j^, and see that 



1 roc u a-l 



(8.9) / t«~\l-tf- 1 dt= I - - j du 



o (l + «) c 

f\ i /*oo / roc \ 

/ t a -\l - tf~ l dt = - - / ( u a ~ l / t a+ P- l e-( l+u ^dt ) du 
Jo V ' r(a + /3)7 V Jo J 



Using p.8p with r = 1 + u in (j8.9j) gives 

"1 1 />oo 



:',2 
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1 



CO 



t a + p-l e -t J u a-l e -vi (}tl ) f ii 

i^V* / u a ~ x e~ u du \ dt 



r(a + /3) 
1 

~ r(a + /3) 

r(a)r(/3) 

r(a + /3) 

as desired. □ 

The Euler beta integral shows the following consistency relation: if u(x±, . . . , xn) in 
does not depend on x k , then 

u(g(t 1 ),...,g(t N ))d®!S 

Go 

N 





-u)) • 








-u))J 




X 







/ u(xi,...,x N ) Y[(x i+1 -Xif^ 1 u) 1 dx 1 ---dx N 

rxk+i 

U(X!, . . .,X N ) 

Y[( x i+i ~ XiY^+^'^^dxkdxi ■ ■ ■ dxk ■ ■ ■ dxN 

i=0 

I u(x 1 ,...,x N ) H (x^-Xif^-^- 1 

J S t\ t 1 L 



A? 



x / (x k - x fc _ 1 ) /3 (* fc -* fc - 1 )- 1 (x fc+1 - x k f^+^- l dx k d Xl ■■■dx k ---dx N 

= r(/3(* fc+1 - t fc _!)) n!^_i, fc - W) L N u{xu "' ,xn) 

x Yl {x l+ i- Xif^+^^dxx-'-dxN 

= I u(9(h),---,g(tk-i),g{tk+i),---,g(t N ))dQ%(g) 

JQo 

where Y*' N := {0 < x\ < ■ ■ ■ < x k -i < x k +i < • • • < x/y < 1} and u is u with the x k variable 
not written (because we u did not depend on it anyways). We used the Euler beta integral in 
the second to last line. 

Proof of Proposition HOI We bound d w ((f N )*l^N, (/ )*MAr+i) as follows. Let tin 6 V(Qo x 
Go) be defined by 



(g,h)dir N (g,h)= [ u(f N (x), f N+l (x))df, N+1 (x) 

JX(T N+1 ) 



U 

Go* Go 

where x = (x% ) and u £ C(<5o x {?())• We claim that this has marginals 

(f N )*f 1 N and (f N+1 )*/j,N+i. The second claim is obvious and the first follows from Euler's beta 
integral, stated in Lemma 18.61 Thus, 

d w ((f N )*m, (f N+1 )*m+i) 2 < I h- h\\ 2 L ^ N { g , h) 

J Go x Go 



\\f N (x)-f N+1 (x)\\ 2 L2 d m+1 (x). 

X T N+ l 
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We have thatB 



2 N+1 

2 iv-i„ 1 



2*-l 



< 



i=0 

1 



2 jv+i 

Inserting this into the above shows that (f N )*[i>N is a Cauchy sequence as claimed. □ 

Thus (f N )*HN — > <?o f° r some measure, which we suggestively write q$ G V(Go)- Now, it 
remains to show that On satisfies 



Proposition 8.7. The measure q$ satisfies (|8.4p . More precisely, 



(8.10) / u{g{t 1 ),...,g{t M ))dq%{g) 



r M 

/ u(xi,...,xjv) J\(xi+i - Xif^+^^dxi ■ ■ ■ dx M 



/or a// partitions {ti}^ =l and continuous functions u : R M — > R. Thus, Qq exists and is equal 
to q$. 

The proof of this is somewhat subtle. The main idea is that in order to show that q^ satisfies 
(|8.4p . for some fixed partition, it is enough to show it after we have added more points to the 
partition. This follows from Euler's beta integral, because the additional terms simply integrate 
out to give the desired integrand, assuming that u does not depend on these coordinates. Thus 
for a given partition < t\ < ■ ■ ■ < tM, we can consider the standard partition T^ d , for 
a very large N, and adjust the appropriate points so that they agree with the points of the 
partition we care about, {ti}. We then show that the pushforward measure associated to this 
partition converges to q$ weakly as N — > oo, which is done by constructing a rather complicated 
transport plan between this pushforward measure, and that associated to the standard partition 
(the intuition behind the plan is that we have changed only M of the elements of the partition, 
which is very small compared to N, the total number of elements, so in the transport plan, we 
should consider the unchanged partition elements to be the same in each, while a small number 
of elements that have changed do not affect things much). Because the adjusted partition 
satisfies (|8.4|) for all N, it is not hard to show that it satisfies it in the limit, showing the desired 
result. 

Proof. By Lemma 18.41 it is enough to show that for any partition < t\ < • • • < < 1, 

T(B) M 

q§(g tl G dx u . . .,g tM G dx M ) = M ]J(x i+1 - x, i ) p{u + 1 - u) - 1 dx l ■ ■ ■ dx M - 

lli=0 r (Pl^+l ~ ti)) i=0 

To show this, first fix the partition < t\ < ■ ■ ■ < tM < 1 and let 5 = sup <j<jv(*i+i ~~ U) and 
take any N so that 1/2 N < 5/2. By this choice, for each t{, there is a unique j(i) such that 

uem/^.m+^i^)- 



This alternatively follows from the calculation proving the Gromov-Hausdorff convergence of X(T^ d ) to Qo 
with g = f N+1 (x) compared to f N (g 1/2 N,. ■ • ,P( 2 «-i)/2«)- 
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Clearly, j(i) is strictly increasing and j(i + 1) > j(i) + 2. Now, we modify the "standard" 
partition T^ d by replacing j(i)/2 N by for each i G {1, . . . , M}. We call this new partition 

Rn- We define a coupling -ir between (f RN )*fJ>R N and (/ T ^ td )*/u T std by 



std >. 

jv 



for if G P(X({i?jv}) x X({T$ d }) given by 

u(x,y)dit(x,y) 



IX({R N })xX({T N }) 

= C l_u{x\x)\[{x i+1 -x i f' 2N - 1 

M 
i=0 

where S = : 1 < i < M} and x' is x with the terms replaced by j/j. Furthermore, we 
have written the normalizing constant as C where 



c 



and we are integrating over the set 

X = {(si, ...,XN,yi,.. .,Vm) ■ 0) G X(R N ),yi G C [0, 1]^+^". 

Again, by Euler's beta integral given in Lemma 18.61 it is not hard to see that ir has the desired 
moments. Given this, we have the bound 

d W ((f RN UR N ,(f TB "U T r) 2 ^ I \\g-h\\l 2 dn(g,h)= [ \\f R »(x>)-f T "(x)\\ 2 L2 dn(x,y). 
As before, we compute 

M 

\\i Rn W) - f T -{x)\\\ 2 = y^[(jh - - mm + (x iW+ i - vifam + 1)/2 - u)\ 

M 



2 2N 

i=l 

M 

This gives 



^ 4v X>* " J'(0/2) + + l)/2 " 



^((/^)^,(/ T -)^ Tjv ) 2 < 17 



22iV+l 



and thus (f RN )*fJ-R N — >■ weakly as iV — > oo. Finally, by Euler's beta integral, it is clear that 
for the original partition < t% < • • • < tM < 1, we have that 

{f RN )*^R N (9t 1 edx!,.. .,g tM G dx M ) 



n£o resell 



i=0 

/3 



Passing to the limit, this proves that satisfies (j8.4j) and thus by Lemma 18.41 Qq exists and 

Thus, we see that Qq exists and is uniquely specified by (|8.4|) . and that we have the conver- 
gence 

(X$ d ,d N , m )^(.Go,d L2 ,®$). 

□ 
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Now that we have constructed Qq, we will give some indication why one might expect 
that (£ ,d i2 ,Qo) admits some Ricci lower bound. The first is that in [38J, von Renesse 
and Sturm give a heuristic argument that the measure : = ^QjJ g V(Vq) (which makes 

Pq) into a metric measure isomorphism, so Ricci lower bounds on 
one space are equivalent to bounds on the other space) can formally be written in the form 

<flpj(/i) = J_ e -0E«*(A«l»O<flp o 

where cflPo is some "uniform" measure on Vq. Because we can think of Ent(/i|m) as a convex 
function on Vq by Proposition 16.41 as well a fact which we have not discussed, which is that 
multiplying a measure by e~ v for some convex function V can only improve Ricci bounds; cf. 
|33j Proposition 4.14. Thus, from this formal point of view, we expect that (Qq, d 1,2 , Qq) admits 
some Ricci lower bound. 

The second reason that one might expect such lower bounds is that von Renesse and Sturm 
have constructed a symmetric Dirichlet form in [35], given as the closure in L 2 (Qq), (E, D(E)), 
of the quadratic form 



E(F) := f \BF(g)\l 2{m) dQ^g) 



'Go 

with domain 

{F(g) = ip ((fi,g) L2im) , . . . , (f m , 0>r»([o,i])) ■m>l,i P e CKR^Jk 6 L 2 ([0, 1])} 

and where U)F(g) is the L 2 -Frechet derivative of F at g, which for F in the domain described 
above is 

m 

BF(g)(x) = J2dnp (</i,5) L 2([ 0i i]) , (fm, 9) m[o,l])) fi( x )- 
i=l 

The existence of such a Dirichlet form is interesting for various reasons (e.g. see |14| ) but in our 
case, it is relevant because Doring and Stannat have shown that E satisfies a Poincare inequality 

Theorem 8.8 ([12] Theorem 1.2). The Dirichlet form constructed in [38], E satisfies a Poincare 
inequality with constant less than \, i.e. for all F G D(K) 

Var Q ,(F) < 1e(F). 

as well clS cL log-Sobolev inequality 

Theorem 8.9 ([12] Theorem 1.4). There exists a constant C (independent of ' j3) such that for 
F G D(E) 

xfi' - - C 



f & F(g) 2 log | 


' F{9? ] 







p (g) < jE(F). 

Thus, while Theorems 18.11 and 18.21 certainly do not show that such inequalities imply lower 
Ricci bounds, they do suggest that (Qq, d L2 , Qq) has some Ricci lower bounds (in fact, examining 
these theorems, they suggest the tantalizingly simple result that the Ricci curvature is bounded 
from below by /?). Upon trying to prove this, the first indication that this might not be true is 
from the fact that the spaces (^^ d , d N , /Zjv) can be seen not to have any generalized Ricci lower 
bounds, by considering the function V, defined by (j,n = e~^Lebj 1 , 2 jv_i. It is then a relatively 
straightforward calculation to show that Hess V cannot be bounded from below as a quadratic 
form on X^ d , and by examining [33] Theorem 4.9, this shows there are no generalized Ricci 
lower bounds on (X^ d , d N , /j,n). This does not prove that (Oo,d L2 ,Qq) does not have lower 
Ricci bounds. 

However, we have managed to show that there are no lower Ricci bounds using intuition 
based on the fact that a Hilbert space (in our case L 2 ([0, 1])) equipped with a measure (in 
our case Qq) having nonnegative Ricci curvature must be log-concave (and in fact this is also 
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sufficient). Further details about this can be found in [21 Chapter 9]. These ideas allow us to 
prove 

Theorem 8.10. There is no K G R, /3 > such that (Go,d L2 , Qg) ^ as generalized Ric > K. 
Proof. Suppose otherwise, so there is some K such that Ric > K. Let, for s G (0, 1) 

A a := {geGo: g(s) > 1/2} 
B:={geg : g(s) > 0}. 

It is clear that 

(8.11) tA s + (1 - t)B = {g£G : g(s) > t/2} := C s (t). 

Notice that 

fe|,) ^ W rS-.)) ( n -^ 

In particular, QjJ(B) = 1, and for s G (0, 1) and t G [0, 1], Q§ (<?,(*)) > 0. Thus, define 

and by assumption, there is a geodesic fi(s)t between ^i(s) and Qq such that Ent(//(s)t|Qg) is 
i^-convex, as in the definition of generalized Ric > K. Because 

Ent(Ms)|«/) = / zJ—M log ( -J—XA S ) dQg = - log(QoVs)) < °c 
J Go <Q%(A S ) \%(A 8 ) J 

(and clearly Ent(QQ,Qg) = 0) so by assumptions of i^T-convexity, we must have that 

Ent(M(s)t|QS) < oo, 

in particular implying that n(s)t -C Qq. Thus, we can write /u(s)t = p(s)iQg, and we have that 



by (|8.1ip . we see that /i(s)t is supported on C s (tjl3> implying that 



J6o 



logp(s) t dyu(s) t 



log ( . 1 

log( ^[ S)t / log (®$(C s (t)))df,(s) t 

Go \dQo\c s (t)J J Go 

= Ent(M(fl) t |Q?|£7.( t )) " log (0S(C a (t)) 

>-io g foS(c s (t)) 



where Qq I cut) = ^ 1 zr Xc 3 <t)Qo- Combining this with the assumed if-convexity of the 
entropy functional along the path fi(s)t, we thus have that 

-log(<Q>J(C s (t))) < EntOxOOtlQ?) 



"^This follows from the fact that optimal transport maps mass along geodesies of the underlying space. This is 
intuitively obvious, as if not, we could move along a geodesic between the endpoints, reducing the total distance 
traveled, and it follows rigorously from 26 Proposition 2.10]. Thus, the support of /u(s)t is contained in U 7 7(t) 
where the union is over all geodesies 7 : [0, 1] — > Go with 7(0) G A s and 7(1) £ B. Because Go is a totally geodesic 
subset of a Hilbert space, we have that this union is just C s (i). 
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< (1 - t) Ent(M(s)|QS) + tEnt((Q>g|<Q>J) i(l - t)d w (f,(s), qJ) 2 . 

=0 



This implies that, because Ent(//(s)|<Q>o) = ~~ 1°S (^Qo (As 

(8.12) log (QS(C,(t))) > (1 - 1) log (OJC^)) + jt(l - Oj) 2 . 

Because diam(C?o) = 1) we must have that d w (fj,(s), Q' 3 ) G [0, 1], so rearranging f|8. 12[) and using 
this observation, we have that 

(8.13) log 

implying that for all s, t G (0, 1 

(8.14) -«ov~«vv/ > 

(QoV.)) 1 " 4 " 

We will show that for a fixed t G (0, 1) 

(8.15) lim Q |< C '"» =0, 
contradicting f)8. 14|) . To see this, note that 

QS(c.(t)) / r(/3) y /^^(l-x)^ 1 -)- 1 ^ 




3.16) 



oU)) 1 - Vr(^W(i- s )); (^^i^^M-i) 1 -'' 



It is not hard to see that because we have fixed t G (0, 1), all of the terms have finite and 
nonzero limits, except for r(/3s) which approaches oo, showing (|8.15p . and thus showing that 
there cannot be any generalized Ricci lower bounds on {Go, d 1,2 , Qq). □ 

8.3. Other Measures. The failure of the entropic measure to admit generalized Ricci bounds 
leads one to the following 

Question 8.11. Are there any reasonable measures on (Qq, d 1,2 ) with lower Ricci bounds? Here, 
"reasonable" is certainly open to interpretation, but one would at least want a measure supported 
on all of Qq. 

By repeating the proof of Theorem I8,10| and appropriate modification of Theorems 9.4.10 
and 9.4.11 in [2] (the stated theorems are given for K = 0, but the proof for general K goes 
through with basically no change) we have the following necessary and sufficient condition 

Theorem 8.12. For a probability measure M G V(Go), the space (Go,d L2 ,M.) has generalized 
Ric > K if and only ifWl is K -log-concave, in the sense that for open sets A,BgQo 

(8.17) log (M((l - t)A + tB)) > (1 - t) log (M(A)) + t log (M(B)) + — t(l - t)d w (M\ A , M\ B ) 2 

where := ^(A) X^M, and similarly for M|b, and d w is the Wasserstein distance on V(Go)- 
In fact, it is enough to check (|8.17p for the balls A = B r {f), B = B r (g) for all < r < ro for 
any ro > and f, g G Go- 

This seems to be a difficult condition to verify, but as in Theorem 18.101 it seems possible 
that it could rule out Ricci lower bounds for a given M. Finally, we mention that Theorem 
17.31 gives a possible method for constructing measures M G V(Go) with lower Ricci bounds, 
by constructing measures on Xn with lower Ricci bounds, whose pushforwards converge to 
M. This is certainly a good way to construct a measure with lower Ricci bounds, because for 
any sequence of measures M^r on Xffi, by Prokhorov's theorem, we can extract a subsequence 

such that (Xp,d N ,M N t) (Go,d L2 ,M) for some M G V(Go)- However, it is quite tricky 
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to choose the Mjy appropriately, so that the limit M is useable in any way. For example, it is 
easy to show that by taking normalized Lebesgue measure on Xfj d we have nonnegative Ricci 
curvature bounds, so some subsequence converges to a measure M E V(Go) with nonnegative 
Ricci curvature. However, we cannot find a reasonably nice description of this measure, and 
have not been able to prove any interesting properties about it, other than the fact that it has 
nonnegative Ricci curvature. For example, we are unable determine the support, which is quite 
unsettling, as if the support of M is not all of Go, then this could be a highly trivial statement 
(for example <5i £ 'P(^o) where 1 is the function t t— )■ 1 has nonnegative Ricci curvature, but 
this statement means absolutely nothing, as the only measure in ^(Go) which is absolutely 
continuous with respect to 5\ is 8\ itself.) 

As a final remark, we note that further exploration of this topic could be aided by the following 
description of geodesies in V(X) where X is a Hilbert spacecj We explain the notation in the 
subsequent paragraph. 

Theorem 8.13 ([2j Theorem 6.2.10). For X a separable Hilbert space, suppose that [i £ V\~{X), 
and v G V2(X) with bounded support, then there exists a locally Lipschitz, d 2 /2-concave, max- 
imal Kantorovich potential ip such that the unique geodesic in V2{X) between /i and v is given 
by (ji)*M; where 

r t (x) = x — tVip(x). 

In the above, T>2{X) is the set of probability measures fj, with f x \x — xo\ 2 dfi(x) < oo for some 
(and thus all) xq, "P£(X) is the set of "regular measures" with finite second moments, where a 
measure \i is "regular" if /u(-B) = for for all Borel set B with the property that 'j(B) = for 
any measure 7 £ V(X) such that 

for all functionals L 6 X* . (Such measures 7 are called Gaussian measures, and such sets B 
are called Gaussian null sets). We furthermore note that Vy? is the "Gateaux differential of ip," 
defined by 

tp(x + y) = ip(x) + (V(f{x), y) + o(\\y\\). 

Finally, the requirement that ip is a maximal Kantorovich potential means that (ip, ip c ) is a 
maximizing pair, achieving the value of 

sup <^ / tp(x)dfi(x) + / ip(y)dv(y) \ . 

(ip,i>)eC b (x)xC b (x) Ux Jx ) 

Thus, because Go C L 2 ([0, 1]) is a totally geodesic subset, this theorem gives a characterization 
of some of the geodesies in V(Go)- We note, however, that it seems rather difficult to check the 
regularity of a measure in V(Go), in the above sense, so this theorem may not actually be of 
much help in verifying displacement convexity for a chosen reference measure M. 

9. Conclusion 

In this section, we briefly mention in what ways the results presented in this paper are 
simplifications of the full theory. There is no need to restrict to compact base spaces, as 
there is a perfectly good notion of Wasserstein distance and optimal transport on the space 
of probability measures on a (complete) geodesic space X with finite second moments (i.e. 
J x d(xo , x) 2 dfi(x) < 00 for some xq G X). This allows us to extend the definition of generalized 

■^We have not discussed noncompact base spaces in our treatment of Wasserstein space, but the reader 
should be reassured that most things discussed above are true, and the biggest changes is that we must restrict 
to measures with finite second moment, J x d(x, xo) 2 dfi(x) < 00 for all xq £ X and the weak* topology is replaced 
by saying that ^ — ¥ y, if it converges in the dual topology of continuous bounded functions on X, i.e. Ct,(X)* 
and furthermore that f x d(xo, x) 2 dfik(x) —>■ J x d(xo,x) 2 dn(x). Anyways, as the reader will soon see, we will 
not actually be interested in geodesies in V(L 2 ([0, 1])), just of those in the totally geodesic subset V(Qo), which 
certainly fits in our framework discussed previously. See the conclusion (Section [9]l for further bibliographic 
references. 
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Ricci bounds to complete geodesic spaces, which may not be compact. In fact, this is preserved 
under the weaker notion of pointed Gromov-Hausdorff convergence, which is better suited to 
the noncompact setting. In another direction, it turns out that it is beneficial to consider n- 
dimensional manifolds equipped with measures of the form e~ v m (where m is the normalized 
volume measure), for some function V. In this case, it is advantageous to consider the iV- Ricci 
tensor for N G [1, oo] 

' Ric + Hess V N = oo 

Ric + Hess V — t^— (dV <g> dV) N G (n,oo) 
Ricn = < , 

j Ric + Hess V - oo{dV ® dV) N = n 

k — oo N < n 

In the N = oo case, it is not hard to generalize the result of Theorem 16.21 and show that weak 
a.c. displacement iC-convexity of Ent(-|e~ v rn) is equivalent to RiCoo > K. However, it turns 
out that there are displacement convexity characterizations of Ricjv > K as well, which we 
have not discussed. Lott and Villani, in [26 1 show that for N < oo, Ric at > is equivalent to 
displacement O-convexity of the Renyi functional, defined by 

S(pm\m) .= — I p~ x l N dm. 
Jx 

In |34j . Sturm discusses this notion, as well as discussing a notion of generalized RicAr > K 
for arbitrary N G [1, oo] and K G R, which is more involved. Sturm proves that under this 
assumption, the metric measure space satisfies the Bishop-Gromov volume inequality (cf. |34[ 
Theorem 2.3]), that such bounds are preserved under Gromov-Hausdorff convergence, as well 
as discussing analytic consequences of such bounds (and other, weaker variants). We note that 
in terms of relationship with Vo, these notions are not useful, however, because such generalized 
bounds imply that the underlying space has Hausdorff dimension less than N (cf. [34} Corollary 
2.5]). Discussion of all of these results can be found in great generality in Villani's book [37] . 

Finally, we note that in [35], Sturm has given a definition of the entropic measure over a 
general compact Riemannian manifold, denoted F@ G V(V(M)). We have not discussed this 
measure, because our discussion of Qq G V(Go) has heavily relied on the existence of the isometry 
vp : Go — > Vq. Such an isometry does not exist in higher dimension^, and thus we cannot make 
use of the Hilbert space setting that proved crucial in, e.g. the proof of Theorem l8.10l However, 
it seems possible that if one carefully examined the proof of Theorem 18. 101 it might be possible 
to translate it back into a proof entirely on Vq, which could then be generalized to show that 

Conjecture 9.1. For M a compact Riemannian manifold, and (3 > 0, using the higher di- 
mensional entropic measure 

p/3 e V{V{M)), as constructed in [35] as a background measure, 
(V(M), d w , P' 3 ) does not admit generalized Ric > K for any K G R. 

As in the one dimensional setting, the question of a natural "volume measure" in V{V{M)) 
with lower Ricci bounds certainly deserves further invesigation. 

Appendix A. Nonsmooth Change of Variables 

In this section, we briefly recall some facts about changing variables under a function that is 
only differentiable almost everywhere. This is particularly important to the theory of optimal 
transport on Riemannian manifolds, because by Theorem 16. 7\ we know that for fj,, v <C vol m, 
the unique geodesic between them is given by (F t )*fj, where F t (x) = exp x (—tV(p(x)) for 4> 
a ^--concave function. Thus, in general Ft is differentiable almost everywhere, and it will 
be important to understand the density of the pushforward measure (Ft)*fi. Our discussion 
below will mostly deal with functions on M ra , but by working locally, can be easily extended to 
manifolds. 



There is still a homeomorphism from Q — > V(M) given by g h- > g*m, where Q is the set of functions on M 
of the form exp x ( — Vtp) for ip a d 2 /2-concave tp, but this is not an isomorphism for dimension n > 1. 
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To begin with, we recall that by the area formula, proven in e.g. |13[ Section 3.2.3]. For 
/ : M n — > W 1 a Lipschitz function, and h : R n — > [0, oo], a nonnegative Borel function, we have 
that 

(A.l) f h(x)\detVf(x)\dx = [ V u{x)dy. 

The following lemma and proof comes from [2J Section 5.5]. 

Lemma A.l. Suppose that p G L 1 (M n ) a nonnegative function with J Rn p(x)dx = 1. Further- 
more, suppose f : M. n — > W 1 is a Lipchitz function such that there is a Borel set A C M. n such 
that {p > 0} \ A has Leb-measure zero and /|a is injective. Then /*(pLeb) <C Leb if and only 
if | det V/| > Leb-a.e. on A, and in this case 



(A.2) /*(pLeb) 



detV/| 



(/U) 



-1 



Leb. 

f{A) 



Proof. If | det V/(x)| > for a.e. x G then from the area formula (jA.ip with 

, PX(/| A )-i(j?)nA 
|detV/| 

for B C M d an arbitrary Borel set, then 



/»(pLeb)(£)= / 

'(/U)- 1 ^) 

p(x)dx 

(/U)-i(B)nA 

det V/(cc)|dx 

rj /L,i ° (/U) _1 (y)dy, 

sn/(A) I det V/ 1 
as desired. 

Conversely, if there is a Borel set -B with Leb(B) > but | det V/(x)| = for x £ B, then 
the area formula (jA.lj) . with h = xb, gives 

0= f \detV f(x)\dx= [ xb o (f\ A r 1 (y)dy = Leb(f\ A (B)). 

However, we have that 

/*(/9Leb)(/U(-B)) = / p(x)dx > / p{x)dx = I p(x)dx > 

y/-i(/U(B)) Jf-i(f\ A (AnB)) Jahb 

because Leb(A n B n {p > 0}) = Leb(# n {p > 0}) > 0. This shows that /*(pLeb) is not 
absolutely continuous with respect to Lebesgue measure. □ 

This easily gives the following ( [37] , Chapter 1 , "change of variables formula" ) 

Lemma A.2. For M a n-dimensional (compact) Riemannian manifold, with m = v ™'^ the 
normalized volume measure, if /Uq,//i <C m with pi = pim and F : M —> M is Lipschitz 
and almost everywhere injective (i.e. when restricted to a set of full measure is injective) with 
F*Po = Ml) then 

po(x) = pi(F(x)) JacF. 
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Appendix B. A Brief Overview of Riemannian Geometry and Curvature 

B.l. Riemannian Metrics and Connections. In this section, we give an incredibly brief 
overview of the basics of Riemannian geometry needed to understand the ideas presented in 
this text. The material presented here is certainly not enough for an appropriate introduction 
to the vast subject of Riemannian geometry. The interested reader could consult any of [29], |15j . 
[28] . or [21], for a proper introduction. Suppose that M is a smooth manifold F^l A Riemannian 
metric g is a (smooth) symmetric nondegenerate bilinear form on the tangent space TM. By 
this, we mean that for p G M, g p is an inner product on the vector space T p M and for X and 
Y any smooth vector fields defined in the neighborhood of p, the function g(X, Y) defined by 
g(X, Y)\ q = g q {X q ,Y q ) is smooth in some neighborhood of p. We write (M,g) for a smooth 
manifold M equipped with a metric g. The Riemannian metric allows us to define a preferred 
notion of directional derivative on (M,g), called the Levi-Civita connection. This is a special 
choice of affine connection, which is a map (writing T(TM) for smooth vector fields) 

V : T(TM) x TiTM) T(TM), (X, Y) i-> V X Y 

obeying the following properties: 

(1) It is linear over C°° (i.e. tensorial) in the first slot. That is, for f,g G C°°(M), and 
X,Y,Z G T(TM) 

V fX+gZ Y = fVxY + gV z Y. 

(2) It obeys the Leibniz rule and is linear over R in the second slot. That is, for / G C°°, 
a, (3 G R and X,Y,Z £ T{TM) 

V x (aY + (3Z) = aV x Y + f3V x Z 

V x (fY) = fV x Y + X(f)Y. 

The Levi-Civita connection is the unique affine connection which is torsion free and compatible 
with the metric g. Being torsion free means that for X,Y £ T(TM) 

(B.l) V X Y- V Y X= [X,Y] 

and compatibility with the metric means that for X,Y, Z G T(TM) 

(B.2) X(g(Y, Z)) = g(V x Y, Z) + g(Y, V X Z). 

To see that the Levi-Civita connection is unique, it is possible to use (|B.ip and (jB.2[) to get an 
expression in local coordinates for what the connection must be, but on the other hand, it is 
possible to see that this gives a connection satisfying (|B.ip and (|B.2|) . 

A connection also allows us to define the notion of the covariant derivative along a curve 7 
which is a R-linear map V — > DV/dt for V a vector field along 7 satisfying the Leibniz rule 

and it is induced by the connection: if V is extendible to V, a vector field in a neighborhood of 
7, then 

DV 



VyV. 

dt 7 

For a curve j(t) and a vector V G T^^M, it turns out that there is a unique vector field along 
7, V such that 

DV 

nr = - 

and V"(0) = V. In this case, we say that V has been parallely transported along 7. 

Covariant differentiation also allows us to give a nice description of geodesies which are locally 
length minimizing curves (notice that this is slightly different than our definition of geodesies 



^For an introduction to smooth manifolds, see [22] 
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in a metric space, which we required to be globally length minimizing). It can be shown that a 
curve 7 : [0, 1] — > M is a geodesic if and only if 

Vjj = 0. 

Assuming that (M, g) is a compact manifold (in the topology determined by the induced length 
metric coming from the definition of the length of a smooth curve being L(j) := f$ \j\ g dt), one 
can show that geodesies exist for all time, and are uniquely determined by their initial position 
and velocity. This allows us to define the exponential map exp p (A), to be 7(1), where 7(f) is 
the unique geodesic starting at p E M with initial tangent vector X E T p M. All of the following 
works (sometimes with slight modification, because in some non-compact manifolds, geodesies 
need not exist for all time) for non-compact manifolds, but since all we consider in the above is 
compact manifolds, we will assume that (M,g) is compact. 

B.2. Curvature. The Levi-Civita connection allows us to then define the Riemann curvature 
tensor 

(B.3) Riem(JT, Y)Z = V x ^yZ - V y V x Z - V [x ,y] z - 

This tensor is very complicated, so it is beneficial to consider the following tensor and scalar, 
the Ricci tensor defined as (for X, Y E T p M) 

n 

(B.4) Ric(X,Y) := ^(Riem(X, Y) 

i=l 

and scalar curvature 

n 

(B.5) S := g&iemiE^EfiE^Ei) 

where Ei is an orthonormal basis for the tangent space at the point of interest (it is not hard 
to see that this is independent of the choice of orthonormal basis). Finally, for any two g- 
orthogonal vectors X, Y E T p M, we define the sectional curvature of the plane spanned by X 
and Y to be 

(B.6) sec(X, Y) := 5 (Riem(X, Y)Y, X). 

It turns out that these are much more manageable things to study and allow for simpler geomet- 
ric interpretation than the full Ricci tensor. We give some geometric interpretations of sectional 
curvature bounds and Ricci curvature bounds in Sections and [SJ additionally discussing Ricci 
bounds in AppendixO We will often write Ric > K or sec > K to mean Ric(X, Y) > Kg(X, Y) 
or sec(X, Y) > Kg(X, Y) respectively. 

B.3. Jacobi Fields. Examining the proof of Lemma 16.81 it is apparent that in the theory of 
optimal transport, a type of vector fields known as Jacobi fields are the link between geometrical 
properties and Ricci curvature. The classical reference for these is Milnor's book [28j^j, but any 
of the books mentioned in the introduction to this appendix should also more than suffice as 
references. A Jacobi field arises from a variation through geodesies, which is a smooth map 

(B.7) a : (— e, e) x [0, 1] — )• M 

so that for a fixed s E (— e, e), 1 1 — >■ a(s, t) is a geodesic. From the map a, we get two covariant 
derivatives D/ds and D/dt for vector fields along a, i.e. V : (— e,e) x [0,1] —> TM such that 
V s> t E T Q ( Sjt )M. Using the definition of Riemannian curvature, (|B.3p . and computing in local 
coordinates, it is possible to show (cf. |28|, Lemma 9.2]) that for a vector field V along a 

(B.8 — — V-— — F = Riem — ,— )V. 

1 ' dsdt dt ds Xds'dtJ 



9ft 

N.B. our conventions about the sign of the curvature tensor are different from Milnor's. 
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Setting V = da/dt, the tangent vector along the geodesies, (|B.8[) gives that 

D D da D D da ( da da\ da 

( } TsJt~di ~ JtTs~bl = Vd?dtjdt" 

Because a is a variation through geodesies, we have that 

— — - () 

di~dt ~ 

and by the torsion free property of the connection, it is not hard to see that we have 

D da D da 
ds dt dt ds 

Combining these equations 

D D da _ . f da da\ da 
— — — + Riem — , — — = 0. 



dt dt ds \ds dt J dt 

Evaluating this at s = and letting J = da/ds\ s= o be the vector field along the geodesic a(0, t), 
we have 

,„ D D „ / da\ da 

This is known as Jacobi's equation and any J satisfying it is called a Jacobi field. Writing (|B.10|) 
in local coordinates, it is possible to show that this is a linear second order ODE, and thus it 
has a unique solution existing as long as 7 is defined, as long as we specify J(0) and J(0) (where 
J = DJ/dt). It is possible to show that any J along a geodesic 7 satisfying (|B.10|) comes from 
some variation through geodesies (see [28} Lemma 14.4]). 

The reason which we are interested in Jacobi fields is that they allow us to compute the 
derivative of the exponential map, as follows. Notice that exp p : T p —> M, so the derivative is 
a map dexp p : T(T p M) —> TM. Because T p M is a vector space, for V £ T p M, Ty(T p M) is 
naturally identified with T p M. For W S Ty(T p M), we have that 

dexpJW) = 4" expJV + sW). 
v ds s=o F 

However, notice that defining 

(B.ll) a{s,t) =exp p (t(V + sW)), 

this is a variation through geodesies, so 

: - da , 

J(t) = ^(o,t) 

is a Jacobi field along ^(t) = exp p (tV) with J(l) = dexp p (W). Furthermore, notice that 
because it is a Jacobi field, J(t) is uniquely described by its initial value and covariant derivative. 
Examining the definition of a, (jB.lip . it is easy to see that J(0) = and J(0) = W (regarded 
as an element of T p M). Thus dexp p (W) = J (I), where J is the unique Jacobi field with these 
initial conditions. A similar calculation, in a more specialized case is done in the proof of Lemma 
16.81 There, we calculate the derivative of a map of the form y \-t ex.p y (X(y)) where X(y) is a 
vector field on M , which follows in roughly the same manner as the above. 



Appendix C. Manifolds with Lower Ricci Bounds 

In this section we give an overview of consequences of Ricci lower bounds in the classical 
sense, for Riemannian manifolds. Recall that if (M, g) is a Riemannian manifold, we say that it 
has Ricci curvature bounded below by K £l (which we will write Ric > K) if Ric(£,£) > -K'ICI 2 
for all f G TM. 



OTIS CHODOSH 



C.l. Topological and Geometric Consequences. It is natural to ask what sort of topolog- 
ical conditions must be met so that a manifold admits a metric of Ricci bounded below by K. 
If K > 0, we have a classical theorem giving strong topological and geometric restrictions on 
the manifold 

Theorem C.l (Bonnet- Myers). For a n- dimensional Riemannian manifold (M,g), if Ric > 
(n — 1)K > then M has a finite fundamental group, i.e. \tt\{M)\ < oo, and 

diam(M, 5 ) < 

\J K 

The standard proof uses Jacobi fields to study minimizing geodesies, and can be found in 
[291 Chapter 4]. If K = 0, then the example (M n ,d]Rn) shows that (M,g) need not be compact, 
but there is the following theorem which shows that this is in some sense essentially the only 
non-compact (simply connected) example. 

Theorem C.2 (Cheeger-Gromoll). If (M,g) is a Riemannian manifold with Ric > 0, then the 
universal cover, equipped with the pullback metric (M,g) is isometric to the product (N,g^) x 
(U. g ,g^q) for some compact (N,gw) and q > 0. 

Even negative Ricci lower bounds place topological restrictions on the underlying manifold, 
as the following theorem shows 

Theorem C.3 (Gallot-Gromov). For (M,g) a Riemannian manifold of dimension n and 
diam(M) < D with Ric > (n — 1)K, then there is a constant C = C (n, KD 2 ^j such that 

h(M) < c, 

where b\(M) is the first Betti number, i.e. the dimension of the homology group Hi(M,M). 
Moreover, there is e(n) > such that if KD 2 > — e(n) then 

h(M) < n. 

Thus, we see that we have a good amount of topological control on manifolds admitting 
metrics with "almost nonnegative" Ricci curvature. Proofs of the above two propositions can 
be found in [29|, Chapter 9]. A good overview of these theorems and more information about 
Ricci lower bounds can be found in [39J. 

We now give the first indication that Ricci curvature bounds are linked to volume measure- 
ments. It turns out that lower Ricci bounds give control over the rate at which the volume of 
balls change with the radius compared to the rate in the standard spaces of constant curvature. 

Theorem C.4 (Bishop-Gromov Inequality). If (M,g) satisfies Ric > (n — 1)K then for all 
p G M , the map 

R ^ vo\ m (Br(p)) 

VOl S n(B R ) 

is nonincreasing for R G (0, oo), where vols^(Bji) is the volume of a ball of radius R in the 
simply connected space form of constant sectional curvature K . 

This plays a crucial role in the proof of Theorem 17.51 which says that the set of manifolds 
with fixed dimension, upper bounds on diameter and lower bounds on Ricci curvature is pre- 
compact in the Gromov-Hausdorff topology. This is because it allows us to uniformly bound 
the number of disjoint e-balls for such manifolds, which it turns out is a sufficient condition of 
precompactness, by a Theorem proven by Gromov (cf. Theorem 7.4.15] for a easy to follow 
proof). 

C.2. Gromov HausdorfF Limits of Bounded Ricci Manifolds. Theorem [73] thus leads to 
the natural question of whether or not we can identify any properties of metric spaces which are 
the limit of manifolds with lower Ricci bounds. Theorem 17.31 which says that generalized Ricci 
bounds are preserved under Gromov-Hausdorff convergence, combined with Theorem l6.21 which 
says that Riemannian manifolds with lower Ricci bounds also have the same generalized lower 
bounds, show that the limiting metric spaces have generalized Ricci lower bounds. However, as 
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we will explain below, it turns out that not all spaces with generalized Ricci lower bounds are 
Gromov-Hausdorff limits of manifolds with Ricci lower bounds. As such, we expect that limits 
of manifolds with lower Ricci bounds enjoy a richer structure than spaces with generalized lower 
Ricci bounds. The study of such spaces was basically started by Cheeger and Colding, in their 
papers [61 [8]. Most of their results are somewhat out of the range of this exposition, but we 
mention a few. They showed that if the Hausdorff dimension of the limiting space is the same as 
the limit manifolds, then except for a set of small codimension, the limiting space is bi-H61der 
equivalent to a smooth manifold. Furthermore, in this case, they showed that the isometry 
group of the limit space is a Lie group. Even if the dimension of the limit decreases (they show 
that it cannot increase), they are able to define a Laplacian and show that eigenvalues and 
eigenfunctions pass to the limit under the Gromov-Hausdorff convergence in some sense. 

One thing they proved is that the limit spaces of manifolds with nonnegative Ricci curvature 
obey a similar sort of splitting theorem as in Theorem lC.2l We do not give the details, but only 
remark that this shows that (1", || • ||) with || • || any Banach norm which is not the standard 
euclidean norm, cannot be a limit of Ricci nonnegative manifolds (because this space cannot 
split). However, it can be seen that this space has nonnegative generalized Ricci curvature, so 
as we claimed above, not all spaces with generalized Ricci lower bounds are limits of manifolds 
with Ricci lower bounds. 

C.3. Upper Ricci Bounds. To finish this section, we make a few remarks on upper Ricci 
bounds. It is not unreasonable to wonder if a notion of displacement concavity is a fruitful 
notion, possibly generalizing upper Ricci bounds. In fact it is really not, for a few reasons. 
First of all, Proposition 16.31 shows that Ent(-|Leb) is not displacement concave on "P([0, 1]), 
even though in this case, Ric = 0. Secondly, it seems unlikely that the notion of displacement 
concavity will be preserved under Gromov-Hausdorff convergence, because examining the proof 
of Theorem 17.31 we see that we would need equality in (|7.2[) . i.e. 

Ent(fi|^) = lim Ent(z>| n ^| / u n ), 

n— >oo 

which from the proof seems unlikely to be true. Finally, we remark that the theory of lower Ricci 
bounds is far richer and more interesting than that of upper Ricci bounds As an example of 
this, we mention three theorems of Lohkamp. 

Theorem C.5 (|23j, Corollary B). Every smooth compact manifold M n of dimension n > 3 
admits a metric with negative Ricci curvature, Ric < 0. 

Theorem C.6 ([25J, Theorem 2). For a fixed smooth manifold M n , any V £ (0,oo), and any 
sequence of positive real numbers = Ao < Ai < A2 < • ■ ■ , there is a sequence of metrics g m on 
M such that 

(1) The k-th eigenvalue of the Laplacian, A gm is A& for k < m. 

(2) The volumes are constant, i.e. vo\(M,g m ) = V. 

(3) The Ricci curvature is bounded from above by —m 2 , i.e. Ric 9m < —in 2 . 

Theorem C.7 ( |24j Theorem B). For a fixed a £M and smooth compact manifold M, the set 
of Riemannian metrics on M with Ric < a is dense in the set of Riemannian metrics on M in 
the C° topology, and in the Gromov-Hausdorff topology. That is, for any metric on M, g, we 
can find g m with Ric 9m < a and where the components of g m converge in the sup-norm to those 
of g, and additionally metrics gk such that Ric 9fc < a and 

(M,g k ) ^ (M,g). 

The first two theorems shows that the existence of a Ricci-negative metric gives no topological 
information in direct contrast with the Ricci-positive case. The second theorem, puts further 
restrictions on the implications of upper Ricci bounds on the eigenvalues of the Laplacian. 
However, the third is probably the most striking, showing that that Ricci upper bounds cannot 
be preserved under Gromov-Hausdorff convergence (in contrast to lower bounds). 



Two sided bounds on Ricci curvature are studied, however; cf. the remarks in Chapter 6 of [39] 
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